marc by ekinakyurek

Research paper implementation for abstract reasoning via test-time training

Created 1 year ago

340 stars

Top 81.1% on SourcePulse

View on GitHub

1 Expert Loves This Project

Shizhe Diao

Author of LMFlow; Research Scientist at NVIDIA

Project Summary

This repository provides the official implementation for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning," focusing on applying Test-Time Training (TTT) to large language models for abstract reasoning tasks. It is intended for researchers and practitioners interested in advancing AI capabilities in complex problem-solving.

How It Works

The project leverages a modified version of the torchtune library for its Test-Time Training pipeline. It fine-tunes large language models (specifically Llama-3 variants) and then applies TTT to adapt these models to specific abstract reasoning tasks during inference. This approach aims to improve performance by allowing the model to learn from the test data distribution without explicit retraining.

Quick Start & Requirements

Installation: Requires cloning the repository recursively, setting up a conda environment (Python 3.10), installing a specific fork of torchtune in editable mode, and then installing other dependencies via pip install torch torchao --pre --upgrade --index-url https://download.pytorch.org/whl/nightly/cu121 and pip install -r requirements.txt.
Data: Requires downloading the ARC dataset from Kaggle.
Models: Pre-trained and fine-tuned models, as well as TTT checkpoints and LoRA adapters, are available on Hugging Face.
Hardware: Requires a CUDA-enabled GPU (cu121 specified).
Docs: https://github.com/ekinakyurek/marc

Highlighted Details

Official implementation of a paper demonstrating TTT effectiveness for abstract reasoning.
Supports Llama-3 and Llama-3.1/3.2 models with specific vLLM compatibility instructions.
Provides LoRA adapters for efficient fine-tuning and TTT.
Includes scripts for both TTT training and inference.

Maintenance & Community

The repository is marked as "in progress" with a caution to report errors. No specific community channels or roadmap are explicitly mentioned in the README.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The repository is explicitly stated to be in progress and should be used with caution. Some functionalities, like lora_to_output, may not apply to all model versions. Separate vLLM environments are required for different Llama versions due to compatibility issues.

marc by ekinakyurek

Explore Similar Projects

SFTvsRL by LeslieTrue

ReasonFlux by Gen-Verse

MetaMath by meta-math

OpenLTM by thuml

Light-R1 by Qihoo360

Seed-Coder by ByteDance-Seed

alpaca_farm by tatsu-lab

PandaLM by WeOpenML

LlamaAcademy by danielgross

OLMoE by allenai

OpenMoE by XueFuzhao

lingua by facebookresearch