Discover and explore top open-source AI tools and projects—updated daily.
DSXiangLiAdvanced Chinese NER toolkit
Top 84.6% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> This repository offers a comprehensive toolkit for Chinese Named Entity Recognition (NER), featuring a wide array of models from traditional BiLSTM-CRF to advanced BERT and Transformer architectures. It caters to NLP researchers and practitioners seeking flexible, state-of-the-art solutions for Chinese text analysis, simplifying the implementation and experimentation of diverse NER techniques.
How It Works
The project implements various NER approaches, including character-based, lexicon-enhanced, multi-task learning (MTL), and Transformer models. It supports techniques like adversarial transfer learning and an MRC (Machine Reading Comprehension) paradigm for NER. Data is processed into TFRecords using specific tokenizers (WordPiece for BERT, pre-trained word vectors for others) and includes data augmentation strategies for enhanced robustness.
Quick Start & Requirements
requirement.txt.tensorflow/serving_model:1.14.0), pre-trained models (download links in folder READMEs), and datasets (preprocess scripts in data). Specific models may require pre-trained word vectors or BERT vocab files.Highlighted Details
Maintenance & Community
The repository encourages community contributions via Pull Requests. No specific details on active maintainers, community channels, or a roadmap are provided in the README.
Licensing & Compatibility
The license type is not explicitly stated in the provided README content.
Limitations & Caveats
The code is noted as "not rigorously tested," indicating potential instability or bugs. The README does not detail specific limitations regarding unsupported platforms or known issues beyond this general testing caveat.
2 years ago
Inactive
namisan
openvinotoolkit