Chinese spell checker for detecting/correcting substitution errors
Top 32.9% on sourcepulse
FASPell is a Chinese spell checker designed to detect and correct substitution errors in simplified and traditional Chinese text. It targets researchers and developers working with noisy Chinese user-generated text, offering state-of-the-art performance as of early 2019.
How It Works
FASPell employs a DAE-Decoder paradigm, leveraging a fine-tuned BERT masked language model to generate candidate corrections. It then uses a filtering mechanism that combines character similarity (based on visual and phonological features) with confidence scores from the language model to rank and select the best correction. This approach allows for fast, adaptable, and powerful spell checking.
Quick Start & Requirements
pip install -r requirements.txt
(requires Python 3.6, TensorFlow >= 1.7, matplotlib, tqdm).apted.jar
are required for tree edit distance similarity calculation.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 years ago
Inactive