entropix by xjdr-alt

Research project for entropy-based context-aware sampling & parallel CoT decoding

Created 1 year ago

3,434 stars

Top 14.1% on SourcePulse

View on GitHub

13 Experts Love This Project

George Hotz

Author of tinygrad; Founder of the tiny corp, comma.ai

Vincent Weisser

Cofounder of Prime Intellect

Alberto Taiuti

Cofounder of Luma AI

Evan Hubinger

Head of Alignment Stress-Testing at Anthropic

and 9 more!

Project Summary

This project explores entropy-based sampling for large language models, aiming to improve inference quality by making sampling context-aware. It targets researchers and developers seeking to enhance LLM reasoning and output through novel sampling techniques, potentially simulating advanced CoT capabilities.

How It Works

Entropix leverages entropy and "varentropy" (variance in entropy) as signals to guide the sampling process. High entropy suggests uncertainty and potential for exploration, while low entropy indicates a more predictable path. The sampler aims to navigate these states to achieve more nuanced and contextually relevant text generation, akin to advanced chain-of-thought prompting.

Quick Start & Requirements

Install: poetry install
Prerequisites: Python 3.x, Poetry, Rust (for tiktoken), Hugging Face CLI (for model weights), CUDA (implied for GPU usage).
Setup: Requires downloading model weights and tokenizer files.
Run: PYTHONPATH=. poetry run python entropix/main.py (JAX) or PYTHONPATH=. poetry run python entropix/torch_main.py (PyTorch).
Docs: [Not explicitly linked, but implied by the structure.]

Highlighted Details

Supports Llama 3.1+ models, with plans for DeepSeek V2 and Mistral Large.
Offers both JAX (for TPU) and PyTorch (for GPU) implementations.
Includes notes on disabling JAX JIT for faster iteration and managing VRAM.
Future plans include splitting into entropix-local (single GPU, Metal) and entropix (multi-GPU, TPU) repos, plus a training component.

Maintenance & Community

The project is described as a research work-in-progress with active development and plans for significant restructuring.
Author is active on X (@_xjdr).
Acknowledges contributions from several individuals for compute and development support.

Licensing & Compatibility

No license is explicitly stated in the README.
Compatibility for commercial use or closed-source linking is undetermined.

Limitations & Caveats

The project is explicitly labeled "HERE BE DRAGONS!!!! THIS IS NOT A FINISHED PRODUCT AND WILL BE UNSTABLE AS HELL RIGHT NOW." Significant restructuring is planned, and PRs are temporarily discouraged. The current state may be partially broken with an unmerged backlog.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

9 stars in the last 30 days