Chinese LLM engine for democratized access and instruction tuning
Top 6.4% on sourcepulse
BELLE is an open-source Chinese conversational large language model engine aiming to lower the barrier for research and application of LLMs, particularly in Chinese. It focuses on providing accessible instruction-following models and training data, enabling users to develop their own high-quality conversational AI.
How It Works
BELLE fine-tunes existing large language models, primarily LLaMA and BLOOMZ, using a substantial corpus of Chinese conversational data. The project emphasizes the impact of training data quality, quantity, and language distribution on model performance, exploring techniques like vocabulary expansion and efficient fine-tuning methods such as LoRA.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively maintained by the BELLEGroup, with regular updates on new models, research reports, and training code. Community engagement is encouraged via Discord and WeChat.
Licensing & Compatibility
Limitations & Caveats
Models may produce factually incorrect or harmful responses and require further improvement in reasoning, coding, and multi-turn dialogue. The project explicitly states models are for research purposes only and prohibits commercial or harmful use. The evaluation methodology has limitations, and reported scores may not fully reflect real-world user experience.
9 months ago
1 day