Research paper on replicating O1 via "journey learning"
Top 22.6% on sourcepulse
This project documents a transparent, real-time replication effort of OpenAI's O1 model, focusing on a novel "journey learning" paradigm. It targets AI researchers and practitioners interested in understanding and reproducing advanced LLM capabilities, particularly in complex reasoning tasks. The primary benefit is the open sharing of methodologies, datasets, and findings for advancing AI research.
How It Works
The project introduces "journey learning," a paradigm emphasizing continuous progress through learning, reflection, and adaptation, mimicking human-like intelligence with capabilities for backtracking and refinement. This approach is applied to replicate O1, with Part 2 demonstrating that simple distillation from O1's API, combined with supervised fine-tuning, can surpass O1-preview performance on mathematical reasoning. Part 3 explores inference-time scaling for medical reasoning, showing significant performance gains with extended reasoning time.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The core development team consists of undergraduate and PhD students from Shanghai Jiao Tong University's GAIR research group, guided by researchers from NYU and MBZUAI. Contact is available via email for those interested in joining.
Licensing & Compatibility
The README does not explicitly state a license. Given the nature of replicating proprietary models and the academic context, users should verify licensing for any derived works or commercial use.
Limitations & Caveats
The project is presented as an ongoing "journey" with resources gradually released. Specific implementation details and direct code for replication are not immediately available in the README, requiring users to consult the linked papers and potentially await further releases.
6 months ago
1 week