tner  by asahi417

NER tool for language model fine-tuning with cross-domain evaluation

created 5 years ago
389 stars

Top 74.9% on sourcepulse

GitHubView on GitHub
Project Summary

T-NER is a Python library for Named Entity Recognition (NER) using transformer-based language models. It offers an easy-to-use interface for fine-tuning models, evaluating them across diverse datasets, and deploying them via a web application. The library is suitable for researchers and practitioners looking to streamline NER tasks and explore model generalization.

How It Works

T-NER leverages the PyTorch framework and integrates seamlessly with Hugging Face's Transformers library. It provides a unified API for accessing and processing numerous public NER datasets, as well as custom datasets formatted in the CoNLL IOB format. The library supports a two-stage parameter search for fine-tuning, optimizing configurations like learning rate, batch size, and CRF layer usage to identify high-performing models.

Quick Start & Requirements

Highlighted Details

  • Supports fine-tuning with a robust parameter search strategy.
  • Integrates over 100 pre-trained NER models and datasets from Hugging Face.
  • Offers a web API for interactive model prediction and visualization.
  • Includes functionality for cross-domain and multilingual NER evaluation.

Maintenance & Community

The project is associated with EACL 2021 and AACL 2022 publications. Further details and community interaction can be found via the GitHub repository.

Licensing & Compatibility

The library is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

While T-NER simplifies many aspects of NER, cross-domain generalization remains a challenge, even with large pre-trained models. The fine-tuning process, especially with extensive parameter search, can be computationally intensive.

Health Check
Last commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 90 days

Explore Similar Projects

Starred by Jeremy Howard Jeremy Howard(Cofounder of fast.ai) and Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

SwissArmyTransformer by THUDM

0.3%
1k
Transformer library for flexible model development
created 3 years ago
updated 7 months ago
Starred by Dominik Moritz Dominik Moritz(Professor at CMU; ML Researcher at Apple), Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake), and
2 more.

ecco by jalammar

0%
2k
Python library for interactive NLP model visualization in Jupyter notebooks
created 4 years ago
updated 11 months ago
Starred by Lilian Weng Lilian Weng(Cofounder of Thinking Machines Lab), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
42 more.

transformers by huggingface

0.2%
148k
ML library for pretrained model inference and training
created 6 years ago
updated 1 day ago
Feedback? Help us improve.