Ling  by inclusionAI

Scalable Mixture-of-Experts LLMs for AI

Created 1 year ago
255 stars

Top 98.7% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

Ling is an open-source Mixture-of-Experts (MoE) Large Language Model (LLM) family from InclusionAI, offering scalable solutions for diverse NLP tasks. It provides two primary model sizes, Ling-lite and Ling-plus, designed for efficient inference and adaptability, fostering community-driven innovation.

How It Works

Ling employs a Mixture-of-Experts (MoE) architecture, enabling selective activation of parameters for efficient computation. It features Ling-lite (16.8B total parameters, 2.75B activated) with a 128K context window and Ling-plus (290B total parameters, 28.8B activated) with a 64K context window. This design allows for significant performance gains while managing computational resources, making it suitable for a wide range of applications.

Quick Start & Requirements

  • Primary Usage: Integration via Hugging Face Transformers (AutoModelForCausalLM, AutoTokenizer) and ModelScope.
  • Inference Frameworks: Supports vLLM (requires applying a specific patch) for batched inference and API services, and MindIE for Ascend hardware.
  • Prerequisites: Standard Python environment for Hugging Face/ModelScope. vLLM typically requires CUDA-enabled GPUs. MindIE deployment necessitates Ascend NPUs, specific drivers (CANN 8.0.0), and a detailed network/Docker setup.
  • Finetuning: Recommended via Llama-Factory.
  • Links: Ling Hugging Face, Ling ModelScope, Llama-Factory.

Highlighted Details

  • Performance: Ling-lite-1.5 demonstrates strong benchmark performance, notably achieving 74.33 on MMLU, 87.27 on HumanEval, and 82.62 on Math benchmarks, often competitive with or exceeding similarly sized open-source models.
  • Context Handling: Ling-lite variants offer an extended 128K context window, with Ling-lite-1.5 showing improved long-text generation capabilities.
  • Model Variants: Includes specialized versions like Ling-coder-lite-base for code generation tasks.
  • Scalability: Offers distinct model sizes (lite and plus) to accommodate varying computational budgets and performance requirements.

Maintenance & Community

The project is actively updated with releases like Ling-lite-1.5 (May 2025) and Ling-lite-0415 (April 2025). While the "Ling Team" is credited, explicit community channels (Discord, Slack) or detailed roadmaps are not provided in the README.

Licensing & Compatibility

The project is released under the permissive MIT License. This license allows for broad use, modification, and distribution, including in commercial and closed-source applications, with minimal restrictions.

Limitations & Caveats

The MindIE inference framework, while detailed, imposes a significant barrier to entry, requiring specialized Ascend hardware and a complex, multi-step setup involving Docker and specific drivers. vLLM integration requires applying a custom patch, suggesting potential instability or lack of upstream support. Some models may need weight format conversion for specific inference engines.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
10 stars in the last 30 days

Explore Similar Projects

Starred by Junyang Lin Junyang Lin(Core Maintainer at Alibaba Qwen), Hanlin Tang Hanlin Tang(CTO Neural Networks at Databricks; Cofounder of MosaicML), and
5 more.

dbrx by databricks

0%
3k
Large language model for research/commercial use
Created 2 years ago
Updated 2 years ago
Feedback? Help us improve.