Ling by inclusionAI

Scalable Mixture-of-Experts LLMs for AI

Created 1 year ago

258 stars

Top 98.0% on SourcePulse

Project Summary

Summary

Ling is an open-source Mixture-of-Experts (MoE) Large Language Model (LLM) family from InclusionAI, offering scalable solutions for diverse NLP tasks. It provides two primary model sizes, Ling-lite and Ling-plus, designed for efficient inference and adaptability, fostering community-driven innovation.

How It Works

Ling employs a Mixture-of-Experts (MoE) architecture, enabling selective activation of parameters for efficient computation. It features Ling-lite (16.8B total parameters, 2.75B activated) with a 128K context window and Ling-plus (290B total parameters, 28.8B activated) with a 64K context window. This design allows for significant performance gains while managing computational resources, making it suitable for a wide range of applications.

Quick Start & Requirements

Primary Usage: Integration via Hugging Face Transformers (AutoModelForCausalLM, AutoTokenizer) and ModelScope.
Inference Frameworks: Supports vLLM (requires applying a specific patch) for batched inference and API services, and MindIE for Ascend hardware.
Prerequisites: Standard Python environment for Hugging Face/ModelScope. vLLM typically requires CUDA-enabled GPUs. MindIE deployment necessitates Ascend NPUs, specific drivers (CANN 8.0.0), and a detailed network/Docker setup.
Finetuning: Recommended via Llama-Factory.
Links: Ling Hugging Face, Ling ModelScope, Llama-Factory.

Highlighted Details

Performance: Ling-lite-1.5 demonstrates strong benchmark performance, notably achieving 74.33 on MMLU, 87.27 on HumanEval, and 82.62 on Math benchmarks, often competitive with or exceeding similarly sized open-source models.
Context Handling: Ling-lite variants offer an extended 128K context window, with Ling-lite-1.5 showing improved long-text generation capabilities.
Model Variants: Includes specialized versions like Ling-coder-lite-base for code generation tasks.
Scalability: Offers distinct model sizes (lite and plus) to accommodate varying computational budgets and performance requirements.

Maintenance & Community

The project is actively updated with releases like Ling-lite-1.5 (May 2025) and Ling-lite-0415 (April 2025). While the "Ling Team" is credited, explicit community channels (Discord, Slack) or detailed roadmaps are not provided in the README.

Licensing & Compatibility

The project is released under the permissive MIT License. This license allows for broad use, modification, and distribution, including in commercial and closed-source applications, with minimal restrictions.

Limitations & Caveats

The MindIE inference framework, while detailed, imposes a significant barrier to entry, requiring specialized Ascend hardware and a complex, multi-step setup involving Docker and specific drivers. vLLM integration requires applying a custom patch, suggesting potential instability or lack of upstream support. Some models may need weight format conversion for specific inference engines.

Ling by inclusionAI

Explore Similar Projects

Ling-V2 by inclusionAI

dFactory by inclusionAI

EXAONE-Deep by LG-AI-EXAONE

Kolosal by KolosalAI

snowflake-arctic by Snowflake-Labs

Seed-Coder by ByteDance-Seed

david-share by david-xinyuwei

tiny-llm-zh by wdndev

dbrx by databricks

guppylm by arman-bd

mlc-llm by mlc-ai

lm-evaluation-harness by EleutherAI