Discover and explore top open-source AI tools and projects—updated daily.
Open-source LLM family for Southeast Asian languages
Top 78.2% on SourcePulse
SEA-LION is a family of open-source Large Language Models (LLMs) specifically designed to understand and cater to the diverse linguistic and cultural contexts of Southeast Asia. It targets researchers, developers, and organizations working with or within the region, aiming to improve representation for under-represented populations and low-resource languages.
How It Works
SEA-LION models are built through a combination of continued pre-training (CPT) and supervised fine-tuning (SFT) on foundational models like Llama 3.1 and Gemma2. This approach leverages existing powerful architectures while adapting them to the specific nuances of Southeast Asian languages and cultures, as evaluated by their custom SEA-HELM benchmark.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 weeks ago
1 week