Toolkit for text error correction, supports multiple models for Chinese
Top 8.6% on sourcepulse
This toolkit provides a comprehensive solution for Chinese text error correction, targeting developers and researchers working with Chinese NLP. It offers a unified platform to evaluate and apply various models for correcting spelling, phonetic, and grammatical errors, significantly improving text quality.
How It Works
The project implements a diverse range of models for text correction, including statistical methods like KenLM (n-gram language models) and deep learning approaches such as Seq2Seq, T5, BERT variants (MacBERT, ERNIE), and large language models (ChatGLM3, Qwen2.5). This multi-model strategy allows for comparison and selection of the best-performing approach based on specific error types and performance requirements.
Quick Start & Requirements
pip install -U pycorrector
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 weeks ago
Inactive