Text-Summarization-Repo by uoneway

Text summarization resources, papers, models, and datasets

Created 5 years ago

347 stars

Top 80.0% on SourcePulse

Project Summary

This repository serves as a curated collection of resources for text summarization, targeting researchers and practitioners in NLP. It aims to provide a comprehensive guide to the field, covering key research topics, essential papers, available models, and datasets, thereby facilitating learning and development in text summarization.

How It Works

The repository organizes information into logical sections, starting with fundamental definitions and task categories (extractive vs. abstractive summarization). It then delves into main research topics like multi-document and long-document summarization, performance improvements through transfer learning and knowledge enhancement, and post-editing techniques. The project also addresses challenges such as data scarcity and evaluation metrics, while exploring controllable text generation and aspect-based summarization.

Quick Start & Requirements

Installation: No direct installation instructions are provided as this is a resource repository. However, links to code repositories for specific models (e.g., KoBART, KoBertSum) are available.
Prerequisites: Familiarity with NLP concepts, embeddings, transfer learning, and Transformer/BERT architectures is recommended for deeper engagement.
Resources: Links to various datasets (e.g., AIHub, WikiLingua, MLSUM) and pre-trained models (e.g., BERT, KoBART, KcBERT) are provided.

Highlighted Details

Comprehensive categorization of summarization tasks and main research topics.
Detailed lists of "Must-read Papers" with keywords and brief descriptions, spanning from classic methods like TextRank to modern approaches like BART and PEGASUS.
Extensive compilation of Korean and English datasets, including details on domain, length, volume, and licensing.
A thorough list of pre-trained models, with a focus on Korean language models.

Maintenance & Community

The repository is maintained by uoneway.
Links to related resources and other "awesome" lists are provided for further exploration.

Licensing & Compatibility

Licenses vary by dataset and model. Some datasets are available under CC-BY-SA-4.0, MIT, or non-commercial research purposes only. Pre-trained models also have different licenses (e.g., Apache 2.0, MIT).

Limitations & Caveats

This repository is a curated list of resources and does not provide a unified framework or tool for text summarization. Users will need to refer to individual model repositories for implementation and usage details.

Health Check

Last Commit

3 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

annotateai by neuml

CLI tool for automated paper annotation using LLMs

Created 1 year ago

Updated 1 month ago

GPT2-Summary by qingkongzhiqian

Chinese summary generation model based on GPT2

Created 5 years ago

Updated 2 years ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA) and

Vaibhav Nivargi

Vaibhav Nivargi(Cofounder of Moveworks).

acl2020-openqa-tutorial by danqi

Tutorial for open-domain question answering research

Created 5 years ago

Updated 5 years ago

NLP-Papers by llhthinker

NLP papers and resources

Created 8 years ago

Updated 5 years ago

nlp-paper by changwookjun

Created 6 years ago

Updated 1 year ago

Text-Analytics by pilsung-kang

Coursework for unstructured data analysis

Created 9 years ago

Updated 4 years ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Artidoro Pagnoni

Artidoro Pagnoni(Coauthor of QLoRA; Research Scientist at Meta).

Summarization-Papers by xcfcode

Papers on text summarization

Created 5 years ago

Updated 2 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

text_mining_resources by stepthom

Resource list for text mining and NLP learning

Created 9 years ago

Updated 2 years ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow) and

Luis Capelo

Luis Capelo(Cofounder of Lightning AI).

awesome-text-summarization by icoxfog417

Resource list for text summarization approaches

Created 8 years ago

Updated 3 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

nlp-journey by msgi

NLP resource collection: papers, code, and articles

Created 6 years ago

Updated 2 days ago

Starred by

Alexander Wu

Alexander Wu(Founder of MetaGPT),

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai), and

1 more.

ChatPaper by kaixindelele

AI tool for summarizing arXiv papers using ChatGPT

Created 2 years ago

Updated 1 month ago

Starred by

Li Jiang

Li Jiang(Coauthor of AutoGen; Engineer at Microsoft) and

Siyuan Zhuang

Siyuan Zhuang(Coauthor of vLLM).

funNLP by fighting41love

NLP resources for various tasks

Created 7 years ago

Updated 1 year ago

Feedback? Help us improve.