DLLM-Survey  by LiQiiiii

Survey of discrete diffusion models for LLMs and multimodal applications

Created 3 months ago
282 stars

Top 92.5% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a comprehensive survey of discrete diffusion models applied to large language and multimodal models. It aims to consolidate research in this rapidly evolving field, offering a structured overview of key concepts, techniques, and applications for researchers and practitioners interested in generative AI.

How It Works

The survey categorizes discrete diffusion models based on their core methodologies, including discrete denoising diffusion probabilistic models, reparameterized discrete diffusion models, and concrete score matching. It details various training techniques such as initialization, masking strategies, and addressing training-testing discrepancies, as well as inference techniques like unmasking, remasking, prefilling, and caching. The paper also explores guidance techniques and categorizes applications across text generation, editing, summarization, sentiment analysis, knowledge reasoning, and multimodal tasks.

Quick Start & Requirements

This is a survey paper, not a software repository. No installation or execution is required. The primary resource is the linked arXiv paper.

Highlighted Details

  • Comprehensive coverage of discrete diffusion models in LLMs and multimodal contexts.
  • Detailed breakdown of training and inference techniques.
  • Extensive categorization of applications, from text generation to biological discovery.
  • Includes a timeline of dLLMs and dMLLMs and links to relevant papers.

Maintenance & Community

The repository is maintained by the authors of the survey paper. Contributions for adding new papers or updating details are welcomed via Pull Requests or email.

Licensing & Compatibility

The content of the repository is for informational purposes. The survey paper itself is likely available under a Creative Commons license or similar, common for academic preprints. Specific licensing for any code snippets or linked resources would depend on their original sources.

Limitations & Caveats

As a survey, this repository does not provide executable code or models. The field is rapidly advancing, and new research may not be immediately reflected. The "2025" in the citation suggests a future publication date, indicating the survey's forward-looking scope.

Health Check
Last Commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
72 stars in the last 30 days

Explore Similar Projects

Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

Hands-On-Large-Language-Models by HandsOnLLM

1.4%
16k
Code examples for "Hands-On Large Language Models" book
Created 1 year ago
Updated 1 month ago
Feedback? Help us improve.