Text classification resource survey, covering shallow/deep learning models
Top 54.7% on sourcepulse
This repository serves as a comprehensive survey and resource hub for text classification, targeting NLP researchers and practitioners. It consolidates papers, models, datasets, and evaluation metrics, primarily drawing from the survey paper "A Survey on Text Classification: From Shallow to Deep Learning," offering a structured overview of the field's evolution and key components.
How It Works
The repository categorizes text classification approaches from traditional shallow learning models (e.g., SVM, Random Forest) to state-of-the-art deep learning architectures (e.g., BERT, RoBERTa, TextGCN). It details various model architectures, their core mechanisms (like attention, graph convolutions, or span masking), and their performance on benchmark datasets, providing a historical and technical progression of the field.
Quick Start & Requirements
This repository is primarily a curated collection of information and links to external resources (papers, GitHub repositories). There is no direct installation or execution command for the repository itself. Users are directed to individual model repositories for setup and usage.
Highlighted Details
Maintenance & Community
The repository is marked as "updating," indicating ongoing curation. Specific contributors or community links (like Discord/Slack) are not explicitly mentioned in the README.
Licensing & Compatibility
The repository itself does not specify a license. However, it links to numerous external projects, each with its own license. Users must consult the licenses of the individual linked repositories for usage and compatibility.
Limitations & Caveats
This repository is a survey and does not provide executable code or pre-trained models directly. Users must navigate to linked external repositories for implementation details and usage. The "updating" status suggests potential for changes and additions.
3 years ago
Inactive