llama_index  by run-llama

Data framework for building LLM-powered agents

created 2 years ago
43,410 stars

Top 0.6% on sourcepulse

GitHubView on GitHub
Project Summary

LlamaIndex is a Python data framework designed to augment Large Language Models (LLMs) with private data. It empowers developers to build LLM-powered applications by providing tools for data ingestion, structuring, and querying, enabling LLMs to access and reason over custom datasets.

How It Works

LlamaIndex offers a modular architecture with a core library and over 300 community-contributed integrations available on LlamaHub. It ingests data from various sources (APIs, PDFs, SQL) using data connectors, structures it into indices (e.g., vector stores, knowledge graphs) for efficient LLM access, and provides a query interface for knowledge-augmented responses. This approach allows for flexible customization of LLM, embedding, and vector store providers.

Quick Start & Requirements

  • Install core and integrations: pip install llama-index-core llama-index-llms-openai llama-index-embeddings-huggingface
  • Requires Python. OpenAI API key or Replicate API token for LLM/embedding models.
  • Example usage and detailed documentation are available.

Highlighted Details

  • High-level API allows data ingestion and querying in as few as 5 lines of Python.
  • Low-level APIs offer customization of data connectors, indices, retrievers, and more.
  • Supports in-memory storage and persistence to disk.
  • Integrates with frameworks like LangChain, Flask, and Docker.

Maintenance & Community

  • Active community with contributions encouraged.
  • Links to Discord, X (formerly Twitter), LinkedIn, and Reddit are provided.
  • LlamaHub serves as a community library for data loaders.

Licensing & Compatibility

  • The repository does not explicitly state a license in the provided README.

Limitations & Caveats

  • The README notes that it is not updated as frequently as the official documentation, recommending users refer to the documentation for the latest information.
Health Check
Last commit

15 hours ago

Responsiveness

1 day

Pull Requests (30d)
153
Issues (30d)
142
Star History
2,284 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Elie Bursztein Elie Bursztein(Cybersecurity Lead at Google DeepMind).

LightRAG by HKUDS

1.0%
19k
RAG framework for fast, simple retrieval-augmented generation
created 10 months ago
updated 14 hours ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Tim J. Baek Tim J. Baek(Founder of Open WebUI), and
2 more.

llmware by llmware-ai

0.2%
14k
Framework for enterprise RAG pipelines using small, specialized models
created 1 year ago
updated 1 week ago
Feedback? Help us improve.