Open-source LLM toolkit for building trustworthy applications
Top 73.7% on sourcepulse
This toolkit provides an open-source framework for building trustworthy LLM applications, targeting developers who need to integrate retrieval, fine-tuning, and AI safety measures. It aims to bridge the gap between general LLMs and specific data stores, enabling customized AI systems aligned with unique intellectual property and safety requirements.
How It Works
The toolkit comprises four main components: TigerRAG for retrieval-augmented generation using embeddings, TigerTune for fine-tuning and evaluating text generation and classification models, TigerDA for data augmentation, and TigerArmor for AI safety evaluation. TigerRAG employs embeddings-based retrieval (EBR), RAG, and generation-augmented retrieval (GAR), utilizing BERT for embeddings and FAISS for indexing. TigerTune supports fine-tuning models like Llama2 and DistilBERT.
Quick Start & Requirements
pip install .
in tiger/TigerRAG
).generation_example.py
in TigerTune; notebooks are available as an alternative.movie_recs
) and TigerTune (classification_example.py
, generation_example.py
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project relies on the OpenAI API, requiring an API key. CUDA GPU is necessary for certain fine-tuning examples, with notebooks provided as an alternative. The README does not specify the license, which may impact commercial adoption.
1 year ago
Inactive