ludwig  by ludwig-ai

Low-code framework for custom AI models (LLMs, neural networks)

Created 6 years ago
11,585 stars

Top 4.4% on SourcePulse

GitHubView on GitHub
Project Summary

Ludwig is a low-code, declarative deep learning framework designed for building custom AI models, including LLMs and multimodal systems. It targets researchers and engineers seeking to streamline model development, training, and productionization without extensive boilerplate code. Ludwig offers significant efficiency gains through automated optimizations and distributed training capabilities.

How It Works

Ludwig utilizes a declarative YAML configuration system to define model architecture, data preprocessing, and training parameters. This approach abstracts away complex coding, allowing users to specify model components and their interconnections. It supports multi-modal and multi-task learning by composing different feature types and encoders/decoders, enabling flexible experimentation with state-of-the-art architectures.

Quick Start & Requirements

  • Install: pip install ludwig or pip install ludwig[full]
  • Prerequisites: Python 3.8+. GPU with at least 12 GiB VRAM recommended for LLM fine-tuning (e.g., Nvidia T4). HuggingFace API token required for Llama-2 access.
  • Resources: LLM fine-tuning examples suggest a GPU with 12GB VRAM.
  • Docs: Official Getting Started Guide, Examples

Highlighted Details

  • Declarative YAML configuration for defining models and training pipelines.
  • Supports LLM fine-tuning with PEFT, QLoRA, and distributed training (DDP, DeepSpeed).
  • Native integration with HuggingFace Transformers and easy export to Torchscript/Triton.
  • Automated hyperparameter optimization, explainability, and metric visualization.

Maintenance & Community

Ludwig is hosted by the Linux Foundation AI & Data. Active community engagement via Discord and X.

Licensing & Compatibility

Licensed under Apache 2.0, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While designed for ease of use, advanced customization may still require understanding deep learning concepts. The README focuses heavily on LLM fine-tuning and tabular data, with less detail on other modalities like audio.

Health Check
Last Commit

1 week ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
29 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
2 more.

towhee by towhee-io

0.0%
3k
Framework for neural data processing pipelines
Created 4 years ago
Updated 11 months ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Li Jiang Li Jiang(Coauthor of AutoGen; Engineer at Microsoft), and
26 more.

ColossalAI by hpcaitech

0.1%
41k
AI system for large-scale parallel training
Created 3 years ago
Updated 12 hours ago
Feedback? Help us improve.