turbopilot by ravenscroftj

Self-hosted code completion engine (deprecated)

Created 2 years ago

3,811 stars

Top 12.6% on SourcePulse

View on GitHub

9 Experts Love This Project

Jeff Hammerbacher

Cofounder of Cloudera

David Cournapeau

Author of scikit-learn

Elvis Saravia

Founder of DAIR.AI

Georgios Konstantopoulos

CTO, General Partner at Paradigm

and 5 more!

Project Summary

TurboPilot was an open-source, self-hosted code completion engine designed to run large language models locally on CPU, targeting developers seeking an alternative to cloud-based AI coding assistants. It aimed to provide efficient, private code suggestions by leveraging quantized models and the llama.cpp library.

How It Works

TurboPilot utilizes the llama.cpp library to run quantized versions of large language models, such as Salesforce Codegen, WizardCoder, and Starcoder, on consumer hardware. This approach allows for local inference, reducing reliance on external servers and enhancing privacy. The project supports various model formats and quantization levels, enabling users with limited RAM (as low as 4GB) to run capable models, while also offering GPU offloading for enhanced performance.

Quick Start & Requirements

Install/Run: Download binaries or run via Docker. Example: ./turbopilot -m starcoder -f ./models/santacoder-q4_0.bin or docker run --rm -it -v ./models:/models -e THREADS=6 -e MODEL_TYPE=starcoder -e MODEL="/models/santacoder-q4_0.bin" -p 18080:18080 ghcr.io/ravenscroftj/turbopilot:latest.
Prerequisites: Models must be downloaded separately (e.g., from Huggingface). CUDA 11/12 required for GPU acceleration via specific Docker images.
Resources: Can run on 4GB RAM for smaller models; GPU recommended for larger models and better performance.
Docs: MODELS.md for model catalog.

Highlighted Details

Supports multiple state-of-the-art local code completion models including WizardCoder, Starcoder, and Santacoder.
Offers CUDA inference support via Docker for GPU acceleration.
API is broadly compatible with OpenAI's format, usable with the vscode-fauxpilot plugin.
Refactored source code for easier extension and model integration.

Maintenance & Community

TurboPilot is deprecated and archived as of September 30, 2023. The author recommends exploring more mature alternatives.

Licensing & Compatibility

The project's licensing is not explicitly stated in the README, but it relies on GGML and llama.cpp, which are typically under permissive licenses. Compatibility for commercial use or closed-source linking would require verification of the specific model licenses and the project's own licensing.

Limitations & Caveats

The project is explicitly marked as deprecated and archived. It was considered a proof-of-concept with potentially slow autocompletion and only supports one GPU device at a time.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days