txtai  by neuml

All-in-one AI framework for semantic search, LLM orchestration, and language model workflows

Created 5 years ago
11,591 stars

Top 4.4% on SourcePulse

GitHubView on GitHub
Project Summary

txtai is an all-in-one AI framework designed for semantic search, LLM orchestration, and language model workflows. It targets developers and researchers building applications like autonomous agents, retrieval augmented generation (RAG) systems, and complex multi-model pipelines. The framework's core benefit is its unified approach to integrating various AI capabilities, simplifying the development of sophisticated AI-powered applications.

How It Works

txtai's foundation is an embeddings database that combines vector indexes (sparse and dense), graph networks, and relational databases. This architecture enables powerful vector search and acts as a knowledge source for LLM applications. It supports creating embeddings for diverse data types (text, audio, images, video) and orchestrating complex tasks through pipelines and workflows, which can be chained together and aggregated with business logic. Agents built on this framework can autonomously solve problems by connecting these components.

Quick Start & Requirements

Highlighted Details

  • Supports semantic search with SQL, object storage, topic modeling, graph analysis, and multimodal indexing.
  • Offers pipelines for LLM prompts, QA, labeling, transcription, translation, and summarization.
  • Enables agent-based autonomous problem-solving using the smolagents framework.
  • Provides Web and Model Context Protocol (MCP) APIs with bindings for JavaScript, Java, Rust, and Go.
  • Includes default configurations for rapid setup and can scale via container orchestration.

Maintenance & Community

  • Actively maintained with regular updates (versions 8.0, 7.0, etc. mentioned).
  • Extensive example notebooks and tutorials available on dev.to and Hashnode.
  • Contribution guide provided for community involvement.

Licensing & Compatibility

  • Licensed under the Apache 2.0 license.
  • Models recommended for commercial use are available.
  • Compatible with various LLM frameworks like llama.cpp and LiteLLM.

Limitations & Caveats

The framework supports Python 3.10+, and while it offers many default models, users may need to install optional dependencies for specific functionalities or advanced use cases. The breadth of features means a learning curve for mastering all capabilities.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
13
Star History
189 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Nir Gazit Nir Gazit(Cofounder of Traceloop), and
4 more.

llmware by llmware-ai

0.6%
14k
Framework for enterprise RAG pipelines using small, specialized models
Created 2 years ago
Updated 1 month ago
Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

Hands-On-Large-Language-Models by HandsOnLLM

1.4%
16k
Code examples for "Hands-On Large Language Models" book
Created 1 year ago
Updated 1 month ago
Feedback? Help us improve.