sealion by aisingapore

Open-source LLM family for Southeast Asian languages

Created 2 years ago

380 stars

Top 75.1% on SourcePulse

1 Expert Loves This Project

chiphuyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Project Summary

SEA-LION is a family of open-source Large Language Models (LLMs) specifically designed to understand and cater to the diverse linguistic and cultural contexts of Southeast Asia. It targets researchers, developers, and organizations working with or within the region, aiming to improve representation for under-represented populations and low-resource languages.

How It Works

SEA-LION models are built through a combination of continued pre-training (CPT) and supervised fine-tuning (SFT) on foundational models like Llama 3.1 and Gemma2. This approach leverages existing powerful architectures while adapting them to the specific nuances of Southeast Asian languages and cultures, as evaluated by their custom SEA-HELM benchmark.

Quick Start & Requirements

Models are available via Hugging Face (links not provided in README).
Requires standard LLM inference hardware (GPU recommended).
Specific model variants may inherit licensing restrictions from base models (e.g., Llama 3.1, Gemma2).

Highlighted Details

Offers multiple model sizes (3B to 70B) and context lengths (up to 128K).
v3.5 models are optimized for reasoning tasks.
Evaluated using SEA-HELM, a custom benchmark focusing on English performance, SEA chat proficiency, instruction-following, and linguistic tasks.
Models are available in Base, Instruct, and GGUF formats.

Maintenance & Community

Anchored by AI Singapore's Products Pillar.
Welcomes community contributions for bug reporting, documentation, evaluation tasks, and model training.
Contact via GitHub issues or an inquiry form.

Licensing & Compatibility

Primarily licensed under MIT, but exact terms depend on the base model used.
Llama-based variants may be subject to the Llama 3 License, potentially restricting commercial use. Gemma-based variants may have different terms. Users must check individual model cards.

Limitations & Caveats

Commercial use restrictions may apply depending on the base model. Users must verify licensing for each specific SEA-LION model.

Health Check

Last Commit

6 days ago

Responsiveness

1 week

Pull Requests (30d)

0

Issues (30d)

0

Star History

7 stars in the last 30 days

Explore Similar Projects

kanana by kakao

Bilingual language models for Korean/English, compute-efficient vs. SOTA

Created 10 months ago

Updated 5 months ago

Mengzi3 by Langboat

LLM for multilingual generation, especially Chinese

Created 1 year ago

Updated 1 year ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

YAYI2 by wenge-research

Chinese LLM for research, base and chat versions, 30B parameters

Created 2 years ago

Updated 1 year ago

Starred by

Ying Sheng

Ying Sheng(Coauthor of SGLang),

Casper Hansen

Casper Hansen(Author of AutoAWQ), and

1 more.

InfiniteBench by OpenBMB

Benchmark for evaluating language models on super-long contexts (100k+ tokens)

Created 2 years ago

Updated 1 year ago

open-korean-instructions by HeegyuKim

Korean instruction datasets for language model training

Created 2 years ago

Updated 9 months ago

sft_datasets by chaoswork

SFT datasets for instruction tuning

Created 2 years ago

Updated 2 years ago

tamil-llama by abhinand5

Tamil LLM fine-tuning project based on Llama 2

Created 2 years ago

Updated 1 year ago

Chinese-LlaMA2 by michael-wzhu

Chinese adaptation of Meta's LLaMA2

Created 2 years ago

Updated 2 years ago

llms by IbrahimSobh

Collection of resources for large language models

Created 2 years ago

Updated 3 months ago

training-fine-tuning-large-language-models-workshop-dhs2024 by dipanjanS

Workshop for training and fine-tuning large language models

Created 1 year ago

Updated 10 months ago

Starred by

Lysandre Debut

Lysandre Debut(Chief Open-Source Officer at Hugging Face) and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Chinese-LLaMA-Alpaca-2 by ymcui

Chinese LLaMA/Alpaca-2: LLMs with long context for Chinese language

Created 2 years ago

Updated 6 months ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI),

Travis Fischer

Travis Fischer(Founder of Agentic), and

5 more.

LLMSurvey by RUCAIBox

Survey paper for large language models

Created 2 years ago

Updated 10 months ago

Feedback? Help us improve.