French LLMs for instruction-following and chat
Top 62.4% on sourcepulse
Vigogne provides a suite of open-source French instruction-following and chat Large Language Models (LLMs), built upon foundational models like LLaMA, Llama-2, and Falcon. It aims to offer powerful, French-centric conversational AI capabilities for researchers and developers working with French language data.
How It Works
Vigogne models are fine-tuned using a combination of translated instruction datasets, self-instruct methods, and ChatGPT distillation techniques. The project emphasizes efficient training, incorporating methods like LoRA and QLoRA, and offers support for advanced acceleration libraries such as FlashAttention for improved performance and reduced memory usage.
Quick Start & Requirements
pip install .
after cloning the repository.deepspeed
, packaging
, ninja
, flash-attn
.Highlighted Details
Maintenance & Community
The project is actively updated, with recent additions including Llama-2 based models, FastChat/vLLM serving support, and LangChain integration. Links to community resources are not explicitly provided in the README.
Licensing & Compatibility
The README mentions commercial-friendly licenses for Falcon and MPT-based models, but the specific license for LLaMA/Llama-2 based models is not detailed. Compatibility for commercial use with all models should be verified.
Limitations & Caveats
Vigogne is under active development, and users should be aware of potential limitations, including the possibility of generating harmful or biased content, incorrect information, or unhelpful answers.
8 months ago
1 week