NLI benchmark dataset for natural language understanding research
Top 73.8% on sourcepulse
The ANLI dataset provides a benchmark for Natural Language Understanding (NLU) by presenting adversarial examples designed to challenge NLI models. It targets researchers in linguistics, machine learning, cognitive science, and psychology, offering a robust evaluation suite and pre-trained models for NLI tasks.
How It Works
ANLI is an adversarial dataset, meaning examples are specifically crafted to fool existing NLI models. It consists of three rounds of increasing difficulty, with annotations for error analysis and verifier labels to ensure quality. The dataset is formatted in JSONL and includes premises, hypotheses, labels (entailment, neutral, contradiction), and reasons for the labels.
Quick Start & Requirements
ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli
).Highlighted Details
Maintenance & Community
The project is maintained by Facebook AI Research (FAIR). The README encourages contributions via pull requests for leaderboard additions and model submissions.
Licensing & Compatibility
ANLI is licensed under Creative Commons-Non Commercial 4.0. This license restricts commercial use and linking with proprietary codebases.
Limitations & Caveats
The dataset's non-commercial license may limit its adoption in commercial products. The provided code and pre-trained models are based on older versions of PyTorch and Transformers, potentially requiring updates for compatibility with current libraries.
3 years ago
1 week