O1-Journey  by GAIR-NLP

Research paper on replicating O1 via "journey learning"

created 10 months ago
1,999 stars

Top 22.6% on sourcepulse

GitHubView on GitHub
Project Summary

This project documents a transparent, real-time replication effort of OpenAI's O1 model, focusing on a novel "journey learning" paradigm. It targets AI researchers and practitioners interested in understanding and reproducing advanced LLM capabilities, particularly in complex reasoning tasks. The primary benefit is the open sharing of methodologies, datasets, and findings for advancing AI research.

How It Works

The project introduces "journey learning," a paradigm emphasizing continuous progress through learning, reflection, and adaptation, mimicking human-like intelligence with capabilities for backtracking and refinement. This approach is applied to replicate O1, with Part 2 demonstrating that simple distillation from O1's API, combined with supervised fine-tuning, can surpass O1-preview performance on mathematical reasoning. Part 3 explores inference-time scaling for medical reasoning, showing significant performance gains with extended reasoning time.

Quick Start & Requirements

Highlighted Details

  • Introduces "journey learning," a paradigm for continuous AI progress with reflection and adaptation.
  • Demonstrates simple distillation from O1's API can outperform O1-preview on mathematical reasoning.
  • Shows inference-time scaling improves medical reasoning performance by 6%-11% with minimal training data.
  • Models exhibit improved generalization, reduced hallucination, and better safety.

Maintenance & Community

The core development team consists of undergraduate and PhD students from Shanghai Jiao Tong University's GAIR research group, guided by researchers from NYU and MBZUAI. Contact is available via email for those interested in joining.

Licensing & Compatibility

The README does not explicitly state a license. Given the nature of replicating proprietary models and the academic context, users should verify licensing for any derived works or commercial use.

Limitations & Caveats

The project is presented as an ongoing "journey" with resources gradually released. Specific implementation details and direct code for replication are not immediately available in the README, requiring users to consult the linked papers and potentially await further releases.

Health Check
Last commit

6 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
19 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Shishir Patil Shishir Patil(Author of BFCL, Gorilla).

SkyThought by NovaSky-AI

0.2%
3k
Training recipes for Sky-T1 family of models
created 6 months ago
updated 3 weeks ago
Feedback? Help us improve.