luke by studio-ousia

Transformer model for language understanding with knowledge-based embeddings

Created 6 years ago

728 stars

Top 47.5% on SourcePulse

View on GitHub

1 Expert Loves This Project

Gabriel Almeida

Cofounder of Langflow

Project Summary

LUKE (Language Understanding with Knowledge-based Embeddings) is a transformer-based language model that incorporates knowledge-based entity representations. It targets NLP researchers and practitioners seeking state-of-the-art performance on tasks like named entity recognition, relation classification, and question answering, offering improved contextual understanding through entity-aware self-attention.

How It Works

LUKE enhances transformer models by integrating entity information directly into the self-attention mechanism. It uses entity-aware self-attention, allowing the model to attend to specific entities within the text. This approach, detailed in their EMNLP 2020 paper, enables LUKE to capture richer contextual representations that benefit downstream NLP tasks.

Quick Start & Requirements

Installation: poetry install (with optional extras for pretraining: pretraining, pretraining opennlp, pretraining icu).
PyTorch: Users may need to manually install a compatible PyTorch version (e.g., poetry run pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu113 for CUDA 11.3).
Dependencies: AllenNLP and Hugging Face Transformers are used for fine-tuning examples.
Resources: Pretrained models range from 125M to 868M parameters. Lite versions offer reduced memory footprints.
Documentation: LUKE and mLUKE

Highlighted Details

Achieves state-of-the-art results on SQuAD v1.1, CoNLL-2003, ReCoRD, TACRED, and Open Entity benchmarks.
Offers "lite" model versions with reduced parameter counts for easier fine-tuning on smaller GPUs.
Provides fine-tuning examples for NER, relation classification, entity typing, and entity disambiguation.
Includes Japanese language models (LUKE-Japanese) with strong performance on JGLUE benchmarks.

Maintenance & Community

The project is actively maintained, with recent updates in late 2022 adding Japanese models and fine-tuning code. LUKE is integrated into the Hugging Face Transformers library, indicating strong community adoption and support.

Licensing & Compatibility

The repository does not explicitly state a license. However, its integration with Hugging Face Transformers suggests it is intended for broad use, including research and potentially commercial applications, but users should verify licensing details.

Limitations & Caveats

The primary installation method uses Poetry, which might be an unfamiliar dependency for some users. Manual PyTorch installation is often required to match specific hardware configurations. The README does not specify a license, which could be a concern for commercial use.

luke by studio-ousia

Explore Similar Projects

PERT by ymcui

kb by allenai

happy-transformer by EricFillion

nlp_notes by YangBin1729

nlp-tutorial by PKU-TANGENT

transformers-tutorials by abhimishra91

setfit by huggingface

EasyNLP by alibaba

ERNIE by thunlp

GLM by THUDM

nlp-recipes by microsoft

DeepSeek-LLM by deepseek-ai