Discover and explore top open-source AI tools and projects—updated daily.
lillian039JAX implementation of continuous diffusion language models
New!
Top 45.8% on SourcePulse
ELF: Embedded Language Flows is a JAX implementation of continuous diffusion language models based on Flow Matching. It addresses the challenge of generating discrete text by operating predominantly within a continuous embedding space, simplifying the integration of techniques from image diffusion models like classifier-free guidance. This approach allows for more straightforward adaptation and potentially more fluent and efficient text generation, targeting researchers and practitioners in the LLM space.
How It Works
ELF utilizes continuous-time Flow Matching, a class of continuous diffusion models. The core innovation is maintaining data representation in a continuous embedding space throughout the diffusion process, only mapping to discrete tokens at the final time step via a shared-weight network. This design facilitates the direct application of established image diffusion model techniques, such as classifier-free guidance (CFG), and enables the model to progressively refine ungrammatical sequences into fluent text by denoising trajectories in the continuous space.
Quick Start & Requirements
pip install -r requirements.txtembedded-language-flows/) via the --checkpoint_path argument.configs/) and Python scripts (src/eval.py, src/train.py) provide detailed usage examples.Highlighted Details
Maintenance & Community
No specific details regarding maintainers, community channels (e.g., Discord, Slack), or active development signals are present in the provided README.
Licensing & Compatibility
The project is released under the MIT License, which generally permits commercial use and integration into closed-source projects without significant restrictions.
Limitations & Caveats
The implementation is primarily optimized and tested for TPUs; performance on other hardware accelerators may vary. A PyTorch version is planned but not yet available. Reported metrics may show slight variations depending on the specific compute setup used for evaluation.
1 week ago
Inactive
Aleph-Alpha-Research
facebookresearch
ZHZisZZ
mlfoundations