LAMA by facebookresearch

Language model probe for factual/commonsense knowledge analysis (research paper)

Created 7 years ago

1,390 stars

Top 28.7% on SourcePulse

6 Experts Love This Project

aravindsrinivas

Aravind Srinivas

Cofounder of Perplexity

shizhediao

Author of LMFlow; Research Scientist at NVIDIA

hammer

Jeff Hammerbacher

Cofounder of Cloudera

codekansas

Cofounder of K-Scale Labs

and 2 more!

Project Summary

LAMA is a probe for analyzing factual and commonsense knowledge within pretrained language models. It offers a unified interface to query models like BERT, RoBERTa, ELMo, and Transformer-XL, enabling researchers and practitioners to assess model capabilities and extract knowledge.

How It Works

LAMA operates by presenting language models with cloze-style prompts (e.g., "The capital of France is [MASK].") and analyzing their predictions. It leverages a dataset of such prompts designed to test specific factual and commonsense knowledge. The project provides connectors to various popular language model architectures, abstracting away model-specific APIs for consistent analysis.

Quick Start & Requirements

Install: pip install -r requirements.txt (after cloning and setting up a conda environment with Python 3.7).
Prerequisites: Requires downloading a ~55 GB models archive (download_models.sh), spaCy (python3 -m spacy download en), and the LAMA dataset (data.zip).
Links: Dataset, Models, BERT conversion.

Highlighted Details

Supports BERT, RoBERTa, ELMo, and Transformer-XL.
Includes scripts for generating contextual embeddings and filling masked tokens.
Offers functionality to create data for LAMA-UHN and Negated-LAMA evaluations.
Can be installed as an editable package (pip install -e git+https://github.com/facebookresearch/LAMA#egg=LAMA).

Maintenance & Community

Developed by Facebook AI Research.
References several key NLP papers and libraries, indicating community engagement.

Licensing & Compatibility

License: CC-BY-NC 4.0 (Creative Commons Attribution-NonCommercial 4.0 International).
Restrictions: Non-commercial use only.

Limitations & Caveats

The CC-BY-NC 4.0 license restricts commercial use.
Requires significant disk space (~55 GB) for models.
Setup involves downloading and unzipping large archives.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

1 stars in the last 30 days

Explore Similar Projects

fltr by moritztng

CLI tool for natural language question answering over text files

Created 2 years ago

Updated 1 year ago

LEval by OpenLMLab

Benchmark for long-context language model evaluation

Created 2 years ago

Updated 1 year ago

Starred by

Ying Sheng

Ying Sheng(Coauthor of SGLang),

Casper Hansen

Casper Hansen(Author of AutoAWQ), and

1 more.

InfiniteBench by OpenBMB

Benchmark for evaluating language models on super-long contexts (100k+ tokens)

Created 2 years ago

Updated 1 year ago

nlp_made_easy by Kyubyong

Code notes explaining NLP building blocks

Created 7 years ago

Updated 6 years ago

Starred by

Simon Willison

Simon Willison(Coauthor of Django),

Meng Zhang

Meng Zhang(Cofounder of TabbyML), and

1 more.

LaMini-LM by mbzuai-nlp

Small, efficient language models distilled from ChatGPT for research

Created 2 years ago

Updated 2 years ago

langtest by Pacific-AI-Corp

NLP testing SDK for model safety and effectiveness

Created 3 years ago

Updated 6 days ago

llms by IbrahimSobh

Collection of resources for large language models

Created 2 years ago

Updated 4 months ago

nlp-paper by changwookjun

Created 6 years ago

Updated 3 weeks ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Didier Lopes

Didier Lopes(Founder of OpenBB), and

2 more.

RULER by NVIDIA

Evaluation suite for long-context language models research paper

Created 1 year ago

Updated 3 months ago

NLP-BERT--ChineseVersion by Y1ran

PyTorch BERT implementation for Chinese readers, mirroring the original Google AI paper

Created 7 years ago

Updated 7 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

BERT-related-papers by tomohideshibata

List of BERT-related research papers

Created 6 years ago

Updated 2 years ago

Starred by

Alexander Wu

Alexander Wu(Founder of MetaGPT).

GLM by THUDM

General language model for NLU, generation, and blank-filling tasks

Created 5 years ago

Updated 2 years ago

Feedback? Help us improve.