simply  by google-deepmind

Minimal JAX codebase for accelerating frontier AI research

Created 1 year ago
490 stars

Top 63.1% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

Simply is a minimal, scalable JAX codebase for rapid iteration on frontier LLM and autoregressive model research. It targets researchers and AI agents, aiming to drastically reduce implementation time for new ideas via a simple, self-contained environment.

How It Works

Built on JAX for accelerated computation, Simply uses Orbax for checkpointing and Grain for data pipelines. Its design emphasizes minimal abstractions and dependencies for a hackable codebase. A key feature is AI agent integration, enabling autonomous research cycles where agents propose, run, and iterate experiments.

Quick Start & Requirements

Install via pip install .. JAX installation is environment-specific (CPU, GPU CUDA 13, TPU), see https://docs.jax.dev/en/latest/installation.html. Optional dependencies: [tfds], [math-eval], [dev]. Assets downloaded via python setup/setup_assets.py. Example local test: python -m simply.main --experiment_config lm_test --experiment_dir /tmp/${EXP} --alsologtostderr. Cloud TPU support is detailed in guides.

Highlighted Details

  • Facilitates rapid iteration and implementation of novel research ideas in LLMs and autoregressive models.
  • Enables automated AI research through agent integration for autonomous experiment design, execution, and iteration.
  • Minimalist design with few dependencies for ease of understanding and modification.
  • Scalable architecture leveraging JAX, Orbax, and Grain for efficient training and data handling.

Maintenance & Community

Contributors include Alex Zhai, Xingjian Zhang, Jiaxi Tang, Lizhang Chen, and Ran Tian. No specific community channels or roadmap links are provided.

Licensing & Compatibility

Software components are licensed under Apache License 2.0 (Apache 2.0), permissive for commercial use. Other materials use Creative Commons Attribution 4.0 International License (CC-BY), requiring attribution.

Limitations & Caveats

As a research codebase for rapid iteration, it may be experimental. The README notes limited initial datasets and models for testing, with more planned. No explicit mention of alpha/beta status or known bugs is present.

Health Check
Last Commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
4
Issues (30d)
0
Star History
440 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), and
1 more.

AgentLaboratory by SamuelSchmidgall

0.5%
5k
Agentic framework for autonomous research workflows
Created 1 year ago
Updated 6 months ago
Feedback? Help us improve.