Discover and explore top open-source AI tools and projects—updated daily.
kengzModular RL framework for agent training
Top 29.7% on SourcePulse
Summary
SLM Lab is a modular deep reinforcement learning (RL) framework in PyTorch, targeting researchers and engineers. It simplifies RL experimentation by offering ready-to-use algorithms, easy configuration via JSON, and robust reproducibility features. The framework aims to streamline research workflows and facilitate scalable training through cloud integrations.
How It Works
The framework's core is its modular design, enabling experiment definition solely through JSON spec files, eliminating code changes. Built on PyTorch and compatible with Gymnasium environments, it ensures reproducibility by automatically logging experiment specs and git SHAs. Integrated analysis tools provide training curves and TensorBoard logging, while dstack and HuggingFace support cloud-based training and results sharing.
Quick Start & Requirements
Installation uses uv: uv tool install --editable .. Run demos with slm-lab run (add --render for visualization). Custom experiments: slm-lab run spec.json spec_name train (local) or slm-lab run-remote spec.json spec_name train (cloud via dstack). Cloud setup requires copying .env.example to .env (for HuggingFace token) and installing dstack. For the "Foundations of Deep Reinforcement Learning" book, checkout v4.1.1. v5.0 updates dependencies.
Highlighted Details
Maintenance & Community
The repository is hosted on GitHub. No specific details on active maintenance, contributors, or community channels (e.g., Discord, Slack) are provided in the README excerpt.
Licensing & Compatibility
Released under the MIT License, permitting broad use, including commercial applications and integration into closed-source projects.
Limitations & Caveats
Version 5.0 updates dependencies to Gymnasium and modern tooling, potentially requiring environment adjustments. Users referencing the companion book must checkout tag v4.1.1 to match its code examples.
1 day ago
Inactive
allenai
SWE-Gym
inoryy
PrimeIntellect-ai
mosaicml
higgsfield-ai