Awesome-LLM-Long-Context-Modeling by Xnhyacinth

Curated list of papers/blogs on LLM long context modeling

Created 2 years ago

1,871 stars

Top 23.0% on SourcePulse

View on GitHub

1 Expert Loves This Project

Binyuan Hui

Research Scientist at Alibaba Qwen

Project Summary

This repository serves as a curated collection of papers and blog posts focused on Large Language Model (LLM) based Long Context Modeling. It aims to be a comprehensive resource for researchers and practitioners interested in techniques that enable LLMs to process and understand extended sequences of text, covering areas like efficient attention mechanisms, memory augmentation, and length extrapolation.

How It Works

The repository organizes a vast and rapidly evolving field by categorizing relevant research papers and articles. It covers key sub-topics such as sparse and linear attention, recurrent transformers, state space models, retrieval-augmented generation (RAG), and various methods for compressing context or extending model context windows. The collection is regularly updated with recent publications, providing a dynamic overview of advancements in long context modeling.

Quick Start & Requirements

This is a curated list of research resources, not a software library. No installation or execution is required. The primary requirement is access to academic papers and online articles.

Highlighted Details

Extensive categorization of papers across 16 major themes related to long context modeling.
Regularly updated "News" section highlighting recently published papers.
Includes links to a comprehensive survey paper and its associated repository.
Provides bibtex citation for the survey paper.

Maintenance & Community

The repository is actively maintained by Xnhyacinth, with contributions welcomed via pull requests. It links to a related GitHub repository for further collaboration.

Licensing & Compatibility

The repository is licensed under the MIT License, allowing for broad use and distribution.

Limitations & Caveats

As a collection of links and summaries, the repository itself does not implement any models or code. The quality and accessibility of the linked papers depend on their original sources.

Awesome-LLM-Long-Context-Modeling by Xnhyacinth

Explore Similar Projects

long-llms-learning by Strivin0311

NBCE by bojone

Selective_Context by liyucheng09

LongRoPE by microsoft

AutoCompressors by princeton-nlp

Samba by microsoft

LongLM by datamllab

unlimiformer by abertsch72

DeepSeek-V3.2-Exp by deepseek-ai

RecurrentGPT by aiwaves-cn

EAGLE by SafeAILab

LongLoRA by JIA-Lab-research