SteptronOss  by stepfun-ai

AI-native LLM training framework for rapid iteration

Created 2 months ago
425 stars

Top 69.7% on SourcePulse

GitHubView on GitHub
Project Summary

A lightweight, AI-native training framework for large language models, StepTronOSS is designed for fast iteration and reproducible experiments. It targets researchers and engineers, offering modular configuration for Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and evaluation workflows, with PyTorch as its sole core dependency.

How It Works

The framework employs a config-driven approach with dynamic validation tools like cfshow and sanity_check for rapid development and inspection. It supports multi-task orchestration and extensible data, optimizer, and model stacks for research agility. Key principles include stateless configurations, runtime state management, and cross-node linkage via Ref. Operator-level replacements enable performance acceleration through custom kernels.

Quick Start & Requirements

Installation requires redis-server and uv (recommended). Core commands involve uv sync install, uv run cfshow for config inspection, uv run torchrun for single-task experiments, and uv run tools/mp_run.py for multi-task setups, which requires STEPTRON_MEET_DIR to be set to a shared directory.

  • Launch Guide (EN): docs/LAUNCH_EXPERIMENTS.md
  • SFT Data Prep (EN): docs/SFT_DATA_PREPARATION_EN.md

Highlighted Details

  • Config-driven experiments with dynamic validation (cfshow, sanity_check).
  • Multi-task orchestration and flexible launch tooling.
  • Extensible data, optimizer, and model stacks for rapid research iteration.
  • Operator-level optimization support (e.g., flash-attn, grouped_gemm).
  • AI-native workflow with agent contribution guidance (AGENTS.md).
  • Reference configurations for Qwen3 8B and Step3.5 Flash.

Maintenance & Community

No specific community links, contributors, or roadmap details were found in the provided README. The project includes AGENTS.md to guide AI agents in contributing code, suggesting a focus on AI-assisted development.

Licensing & Compatibility

The provided README does not specify a license type or compatibility notes for commercial use.

Limitations & Caveats

The framework is actively under development. Evaluation, RLHF, and Triton kernel implementations are marked as incomplete. Users should note the absence of these features in the current release.

Health Check
Last Commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
21
Issues (30d)
2
Star History
429 stars in the last 30 days

Explore Similar Projects

Starred by Zhiqiang Xie Zhiqiang Xie(Coauthor of SGLang), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
3 more.

Trace by microsoft

0.1%
711
AutoDiff-like tool for end-to-end AI agent training with general feedback
Created 1 year ago
Updated 3 months ago
Feedback? Help us improve.