llm-vscode by huggingface

VSCode extension for LLM-powered code development

Created 2 years ago

1,317 stars

Top 30.2% on SourcePulse

View on GitHub

10 Experts Love This Project

Jared Palmer

SVP at GitHub; Founder of Turborepo; Author of Formik, TSDX

Vincent Weisser

Cofounder of Prime Intellect

Leandro von Werra

Head of Research at Hugging Face

Gregor Zunic

Cofounder of Browser Use

and 6 more!

Project Summary

This VSCode extension provides LLM-powered code completion and generation, targeting developers who want to integrate large language models into their workflow. It offers "ghost-text" style code completion and code generation capabilities, aiming to enhance productivity by leveraging AI models.

How It Works

The extension utilizes llm-ls as its backend Language Server Protocol (LSP) implementation. It sends code context and prompts to configurable backends, including Hugging Face Inference API, Ollama, OpenAI-compatible endpoints, and Text Generation Inference (TGI). Prompts are dynamically sized to fit the model's context window using tokenizers, and a code attribution feature checks generated code against "The Stack" dataset for potential licensing issues.

Quick Start & Requirements

Install via VSCode Marketplace.
Default inference uses bigcode/starcoder and Hugging Face Inference API.
Requires a Hugging Face API token for certain features or to avoid rate limits. Login via VSCode command palette (Llm: Login).
Official documentation and configuration details are available within VSCode settings.

Highlighted Details

Supports "ghost-text" code completion similar to GitHub Copilot.
Configurable backends: Hugging Face Inference API, Ollama, OpenAI-compatible, TGI.
Automatic prompt sizing to fit model context windows using tokenizers.
Code attribution feature checks generated code against "The Stack" dataset.
Supports various models like Code Llama, Phind-CodeLlama, and WizardCoder.

Maintenance & Community

The project is maintained by Hugging Face. Related repositories include huggingface-vscode-endpoint-server and llm-vscode-inference-server.

Licensing & Compatibility

The extension itself is likely under a permissive license, but usage of specific models via the Hugging Face Inference API may be subject to Hugging Face's terms of service and pricing tiers (e.g., PRO plan for higher rate limits).

Limitations & Caveats

Using the Hugging Face Inference API's free tier may result in rate limiting. The code attribution feature uses a Bloom filter, which can produce false positives, and recommends a second pass with a dedicated search tool for complete accuracy. Developing locally requires building the llm-ls binary, with potential platform support issues if not using pre-built binaries.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

5 stars in the last 30 days