Omega-AI by dromara

Java DL framework for model training/inference, supporting multi-GPU

Created 6 years ago

496 stars

Top 62.6% on SourcePulse

Project Summary

Omega-AI is a deep learning framework built in Java, designed to simplify the creation, training, and inference of neural networks for Java developers. It supports automatic differentiation, multi-threading, and GPU acceleration via CUDA and cuDNN, enabling rapid development of AI models.

How It Works

Omega-AI provides a comprehensive set of layers, optimizers, and loss functions, allowing users to build various neural network architectures. Its core advantage lies in its Java-native implementation, aiming to lower the barrier to entry for AI development within the Java ecosystem. The framework emphasizes performance through GPU acceleration and optimized CUDA kernels for operations like matrix multiplication and convolution.

Quick Start & Requirements

Installation: Add the omega-engine-v4-gpu artifact to your Maven or Gradle project.
Prerequisites: Java Development Kit (JDK), CUDA Toolkit (version must match the omega-engine-v4-gpu dependency, e.g., CUDA 11.7 for win-cu11.7-v1.0-beta), cuDNN.
GPU Support: Requires NVIDIA GPU with CUDA support.
Documentation: Official website: https://omega-ai.dromara.org
Code: GitHub: https://github.com/dromara/Omega-AI, Gitee: https://gitee.com/dromara/omega-ai

Highlighted Details

Supports a wide range of models including BP, CNN, RNN, VGG16, ResNet, YOLO, Transformer, GPT, LLaMA, and Diffusion models.
Features automatic differentiation and multi-threaded CPU/GPU computation.
Includes demos for various tasks like image classification (MNIST, CIFAR-10), object detection (YOLOv1, v3, v7), text generation (RNN, GPT), and image generation (GAN, Diffusion).
Provides implementations for common layers (Convolution, Pooling, Fully Connected, RNN, LSTM, Transformer blocks) and activation/normalization functions (ReLU, Leaky ReLU, BN, LN).

Maintenance & Community

Active development with regular updates and new model implementations.
Community discussion via QQ group: 119593195.
Contact: 465973119@qq.com.

Licensing & Compatibility

The specific license is not explicitly stated in the README, but the project is hosted on Gitee and GitHub, suggesting a permissive open-source license. Compatibility for commercial use would require verification of the license.

Limitations & Caveats

The project is primarily Java-focused, which may limit its adoption by users accustomed to Python-based deep learning ecosystems.
GPU support relies on JCUDa and specific CUDA/cuDNN versions, requiring careful environment setup.
Some demos and model implementations might be in early stages or require specific dataset formats.

Health Check

Last Commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

1

Star History

1 stars in the last 30 days

Explore Similar Projects

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

1 more.

varuna by microsoft

Tool for efficient large DNN model training on commodity hardware

Created 4 years ago

Updated 1 year ago

MPP-LLaVA by Coobiw

MLLM for training LLaVA-like models on limited hardware

Created 2 years ago

Updated 10 months ago

awesome-cuda-and-hpc by coderonion

Curated list of CUDA and HPC resources

Created 2 years ago

Updated 5 months ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

femtoGPT by keyvank

Rust library for minimal Generative Pretrained Transformer (GPT) models

Created 2 years ago

Updated 2 months ago

app_deep_learning by jeffheaton

PyTorch course for deep learning applications

Created 2 years ago

Updated 17 hours ago

neural-api by joaopauloschuler

Pascal-based deep learning API for AVX/OpenCL-capable devices

Created 6 years ago

Updated 2 weeks ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

zero_to_gpt by VikParuchuri

Course for training your own GPT model from scratch

Created 3 years ago

Updated 1 year ago

KuiperInfer by zjhellofss

Deep learning inference library for model deployment

Created 3 years ago

Updated 6 months ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

3 more.

mindspore by mindspore-ai

Deep learning framework for mobile, edge, and cloud training/inference

Created 6 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

MNN by alibaba

Lightweight deep learning framework for on-device inference and training

Created 6 years ago

Updated 3 days ago

Starred by

Peter Norvig

Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google),

Alexey Milovidov

Alexey Milovidov(Cofounder of Clickhouse), and

29 more.

llm.c by karpathy

LLM training in pure C/CUDA, no PyTorch needed

Created 1 year ago

Updated 6 months ago

Starred by

Benjamin Bolte

Benjamin Bolte(Cofounder of K-Scale Labs),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

11 more.

DeepLearningExamples by NVIDIA

Deep learning examples for training and deployment

Created 7 years ago

Updated 1 year ago

Feedback? Help us improve.