WizardLM  by nlpxucan

LLMs built using Evol-Instruct for complex instruction following

created 2 years ago
9,436 stars

Top 5.4% on sourcepulse

GitHubView on GitHub
Project Summary

WizardLM is a suite of large language models (LLMs) designed to excel at following complex instructions, with specialized versions for coding (WizardCoder) and mathematical reasoning (WizardMath). It targets researchers and developers seeking high-performance, instruction-following models that outperform many existing open-source alternatives and compete with leading proprietary models.

How It Works

The core innovation is Evol-Instruct, a method that uses LLMs to automatically generate diverse and complex instructions, progressively increasing difficulty. This approach enhances the model's ability to understand and execute intricate commands, leading to improved performance across various benchmarks.

Quick Start & Requirements

  • Models are available via Hugging Face 🤗 HF Repo.
  • Requires Python 3.9+.
  • Specific model requirements (e.g., GPU, VRAM) depend on the model size.
  • Refer to individual model directories for detailed setup and inference scripts.

Highlighted Details

  • WizardCoder-33B-V1.1 achieves SOTA OSS performance on EvalPlus Leaderboard, outperforming ChatGPT 3.5 and Gemini Pro on HumanEval benchmarks.
  • WizardMath-7B-V1.1 is a top-performing 7B math LLM, surpassing ChatGPT 3.5 and Gemini Pro on GSM8k.
  • WizardLM-70B-V1.0 demonstrates strong performance on MT-Bench and AlpacaEval, with competitive GSM8k and HumanEval scores.
  • Models are fine-tuned from various base models, including Llama and DeepSeek-Coder.

Maintenance & Community

  • Active development with recent releases of WizardCoder-33B-V1.1 and WizardMath-7B-V1.1.
  • Community engagement via Discord.
  • Researchers are encouraged to provide feedback on issues and suggestions.

Licensing & Compatibility

  • Code License: Apache 2.0.
  • Data License: CC BY-NC 4.0.
  • Model weights are subject to the Llama 2 License or specific non-commercial licenses for older versions.
  • Strictly academic research and non-commercial use.

Limitations & Caveats

  • Data used for training is not publicly released due to organizational policy and legal review.
  • Output accuracy is not guaranteed due to model randomness.
  • Older model versions (e.g., WizardLM-13B-V1.1, WizardLM-30B-V1.0, WizardLM-7B-V1.0) are explicitly marked as non-commercial.
Health Check
Last commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
77 stars in the last 90 days

Explore Similar Projects

Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Calvin French-Owen Calvin French-Owen(Coounder of Segment), and
12 more.

StableLM by Stability-AI

0.0%
16k
Language models by Stability AI
created 2 years ago
updated 1 year ago
Feedback? Help us improve.