tner  by asahi417

NER tool for language model fine-tuning with cross-domain evaluation

Created 5 years ago
392 stars

Top 73.4% on SourcePulse

GitHubView on GitHub
Project Summary

T-NER is a Python library for Named Entity Recognition (NER) using transformer-based language models. It offers an easy-to-use interface for fine-tuning models, evaluating them across diverse datasets, and deploying them via a web application. The library is suitable for researchers and practitioners looking to streamline NER tasks and explore model generalization.

How It Works

T-NER leverages the PyTorch framework and integrates seamlessly with Hugging Face's Transformers library. It provides a unified API for accessing and processing numerous public NER datasets, as well as custom datasets formatted in the CoNLL IOB format. The library supports a two-stage parameter search for fine-tuning, optimizing configurations like learning rate, batch size, and CRF layer usage to identify high-performing models.

Quick Start & Requirements

Highlighted Details

  • Supports fine-tuning with a robust parameter search strategy.
  • Integrates over 100 pre-trained NER models and datasets from Hugging Face.
  • Offers a web API for interactive model prediction and visualization.
  • Includes functionality for cross-domain and multilingual NER evaluation.

Maintenance & Community

The project is associated with EACL 2021 and AACL 2022 publications. Further details and community interaction can be found via the GitHub repository.

Licensing & Compatibility

The library is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While T-NER simplifies many aspects of NER, cross-domain generalization remains a challenge, even with large pre-trained models. The fine-tuning process, especially with extensive parameter search, can be computationally intensive.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Gabriel Almeida Gabriel Almeida(Cofounder of Langflow), and
5 more.

lit by PAIR-code

0.1%
4k
Interactive ML model analysis tool for understanding model behavior
Created 5 years ago
Updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Travis Fischer Travis Fischer(Founder of Agentic), and
2 more.

modelscope by modelscope

0.2%
8k
Model-as-a-Service library for model inference, training, and evaluation
Created 3 years ago
Updated 1 day ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), François Chollet François Chollet(Author of Keras; Cofounder of Ndea, ARC Prize), and
42 more.

spaCy by explosion

0.1%
32k
NLP library for production applications
Created 11 years ago
Updated 3 months ago
Feedback? Help us improve.