Framework for enterprise RAG pipelines using small, specialized models
Top 3.6% on sourcepulse
This library provides a unified framework for building enterprise Retrieval-Augmented Generation (RAG) pipelines using small, specialized LLMs. It targets developers and researchers seeking to deploy private, cost-effective, and adaptable LLM applications for tasks like fact-based question-answering, classification, and summarization. The core benefit is enabling rapid development of knowledge-based enterprise LLM applications with a focus on small, efficient models.
How It Works
The framework comprises two main components: a RAG Pipeline for managing the lifecycle of connecting knowledge sources to LLMs, and a catalog of over 50 small, specialized models (SLIM, BLING, DRAGON series) fine-tuned for enterprise tasks. It supports various model formats (GGUF, HuggingFace, Sentence Transformers) and integrates with multiple database backends (SQLite, MongoDB, PostgreSQL) and vector stores (Milvus, ChromaDB, FAISS, etc.). This approach allows for flexible deployment, from local laptops to scalable clusters, and emphasizes the use of smaller models for efficiency and privacy.
Quick Start & Requirements
pip3 install llmware
or pip3 install 'llmware[full]'
Highlighted Details
ModelCatalog
for easy access to various model types and formats.Library
class for parsing, chunking, and indexing diverse document types (PDF, DOCX, WAV, JPG, etc.).Agents
module (LLMfx
) for multi-model workflows and function calling.Maintenance & Community
The project is actively maintained with frequent releases and updates. Community engagement is encouraged via GitHub discussions.
Licensing & Compatibility
The project is released under the MIT License, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
Some users have reported issues with PyTorch 2.3 and NumPy 2.0, recommending downgrading to compatible versions (PyTorch 2.1, NumPy < 2.0). Support for specific Linux versions might require raising an issue.
1 week ago
1 day