cortex.cpp  by menloresearch

Local AI API platform for robots, optional cloud

Created 2 years ago
2,762 stars

Top 17.2% on SourcePulse

GitHubView on GitHub
Project Summary

Cortex.cpp is a local AI API platform designed for robotics and AI applications, enabling users to run various AI models (vision, speech, language, tabular, action) without cloud dependency. It targets developers and researchers seeking an efficient, self-contained AI inference solution.

How It Works

Cortex.cpp utilizes a multi-engine architecture, initially supporting llama.cpp, with extensibility for custom engines. It features automatic hardware acceleration for NVIDIA, AMD, and Intel GPUs, optimizing inference performance. The platform exposes an OpenAI-compatible API, simplifying integration with existing tools and workflows.

Quick Start & Requirements

  • Installation: Download platform-specific installers (exe, pkg, deb) or use curl -s https://raw.githubusercontent.com/menloresearch/cortex/main/engine/templates/linux/install.sh | sudo bash for other Linux distributions.
  • Prerequisites: None explicitly stated beyond OS compatibility.
  • Usage: Start server with cortex start, pull models with cortex pull <model_name>, and run models with cortex run <model_name>.
  • Docs: Cortex Docs, API Reference

Highlighted Details

  • Supports multiple AI model types including Llama 3, Phi-4, Mistral, and Gemma.
  • Offers automatic GPU detection and hardware optimization.
  • Provides an OpenAI-compatible API for seamless integration.
  • Includes nightly builds with advanced features like multiple quantizations and engine management.

Maintenance & Community

  • Actively under development with rapid improvements.
  • Community support available via Discord.
  • Issues can be reported on GitHub.

Licensing & Compatibility

  • License not explicitly stated in the README. Compatibility for commercial or closed-source use is not specified.

Limitations & Caveats

The project is under active development, indicating potential for rapid changes and breaking updates. Specific licensing details for commercial use are not provided.

Health Check
Last Commit

2 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), Elvis Saravia Elvis Saravia(Founder of DAIR.AI), and
15 more.

semantic-kernel by microsoft

0.3%
26k
SDK for building intelligent AI agents and multi-agent systems
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.