Yi-1.5  by 01-ai

Yi-1.5: upgraded open-source language model series

created 1 year ago
554 stars

Top 58.7% on sourcepulse

GitHubView on GitHub
Project Summary

Yi-1.5 is a suite of large language models (LLMs) offering enhanced capabilities in coding, math, reasoning, and instruction following. Targeting developers and researchers, it provides improved performance over its predecessor, Yi, with models available in 34B, 9B, and 6B parameter sizes.

How It Works

Yi-1.5 is built upon a foundation of continuous pre-training on a 500 billion token corpus, followed by fine-tuning on 3 million diverse samples. This extensive training regimen aims to bolster its proficiency in complex cognitive tasks while retaining strong language understanding and commonsense reasoning abilities.

Quick Start & Requirements

  • Installation: pip install -r requirements.txt
  • Prerequisites: Python 3.10+, transformers library. Models can be downloaded from Hugging Face, ModelScope, or WiseModel.
  • Local Inference: Example provided using Hugging Face transformers for local execution on CUDA-enabled GPUs.
  • Ollama: Supports running Yi-1.5 models locally via ollama run yi:v1.5.
  • vLLM: Deployment via vLLM's OpenAI-compatible API server is supported.
  • Web Demo: Local web demo available via python demo/web_demo.py -c <your-model-path>.
  • Docs: Yi Cookbook, FAQ, Learning Hub

Highlighted Details

  • Enhanced performance in coding, math, reasoning, and instruction-following.
  • Available in 34B, 9B, and 6B parameter sizes.
  • OpenAI-compatible API available via Yi Platform, Replicate, and OpenRouter.
  • Supports fine-tuning with popular frameworks like LLaMA-Factory, Swift, XTuner, and Firefly.

Maintenance & Community

Licensing & Compatibility

  • Licensed under Apache 2.0.
  • Derivative works require attribution to 01.AI.

Limitations & Caveats

The README does not specify hardware requirements for each model size or provide explicit benchmarks comparing Yi-1.5 against other leading LLMs.

Health Check
Last commit

8 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Simon Willison Simon Willison(Author of Django), and
4 more.

Yi by 01-ai

0.1%
8k
Open-source bilingual LLMs trained from scratch
created 1 year ago
updated 8 months ago
Feedback? Help us improve.