oneflow  by Oneflow-Inc

Deep learning framework for user-friendly, scalable, efficient model development

created 8 years ago
9,361 stars

Top 5.5% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

OneFlow is a deep learning framework designed for user-friendliness, scalability, and efficiency. It targets researchers and engineers looking to program models with a PyTorch-like API, scale them to n-dimensional parallel execution using Global Tensor, and accelerate deployment via its Graph Compiler.

How It Works

OneFlow utilizes a Global Tensor abstraction to manage distributed data across multiple devices and nodes, enabling seamless n-dimensional parallelism. Its Graph Compiler optimizes the computation graph for efficient execution, facilitating model acceleration and deployment. This approach aims to simplify distributed training and inference compared to traditional frameworks.

Quick Start & Requirements

  • Install: python3 -m pip install oneflow (stable CUDA), python3 -m pip install --pre oneflow -f https://oneflow-staging.oss-cn-beijing.aliyuncs.com/branch/master/cu118 (nightly CUDA).
  • Prerequisites: Python 3.7-3.11, CUDA arch 60+, CUDA Toolkit 10.0+, Nvidia driver 440.33+. Docker images are available.
  • Resources: Building from source requires libopenblas-dev, nasm, g++, gcc, python3-pip, cmake, autoconf, libtool.
  • Docs: QUICKSTART, API Reference.

Highlighted Details

  • PyTorch-like API for ease of use.
  • N-dimensional parallelism via Global Tensor.
  • Graph Compiler for deployment acceleration.
  • Includes Libai for large-scale Transformer models and FlowVision for computer vision tasks.

Maintenance & Community

  • Developed by OneFlow Inc and Zhejiang Lab.
  • Community channels include GitHub issues, QQ group (331883), WeChat, Discord, Twitter, LinkedIn, and Medium.

Licensing & Compatibility

  • License: Apache License 2.0.
  • Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

  • Some model zoo links (OneFlow-Models, OneFlow-Benchmark) are marked as outdated.
  • Building from source requires specific system dependencies and careful CMake configuration.
Health Check
Last commit

2 days ago

Responsiveness

1 day

Pull Requests (30d)
2
Issues (30d)
0
Star History
1,129 stars in the last 90 days

Explore Similar Projects

Starred by Tri Dao Tri Dao(Chief Scientist at Together AI), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
1 more.

oslo by tunib-ai

0%
309
Framework for large-scale transformer optimization
created 3 years ago
updated 2 years ago
Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake) and Zhiqiang Xie Zhiqiang Xie(Author of SGLang).

veScale by volcengine

0.1%
839
PyTorch-native framework for LLM training
created 1 year ago
updated 3 weeks ago
Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
6 more.

gpt-neox by EleutherAI

0.1%
7k
Framework for training large-scale autoregressive language models
created 4 years ago
updated 1 week ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
12 more.

DeepSpeed by deepspeedai

0.2%
40k
Deep learning optimization library for distributed training and inference
created 5 years ago
updated 1 day ago
Starred by Peter Norvig Peter Norvig(Author of Artificial Intelligence: A Modern Approach; Research Director at Google), Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), and
45 more.

tensorflow by tensorflow

0.1%
191k
Open-source ML framework
created 9 years ago
updated 15 hours ago
Feedback? Help us improve.