ProphetNet  by microsoft

Research project for natural language generation models

created 5 years ago
733 stars

Top 48.2% on sourcepulse

GitHubView on GitHub
Project Summary

ProphetNet is a research project focused on natural language generation (NLG), offering official implementations of advanced pre-trained models and benchmarks. It targets researchers and developers in NLP seeking state-of-the-art techniques for text generation tasks. The project provides a suite of models and frameworks, including ProphetNet, GLGE, JGR, GENIE, AR-diffusion, and CRITIC, enabling exploration of various NLG paradigms.

How It Works

The project encompasses several distinct NLG approaches. ProphetNet utilizes a novel "future n-gram prediction" mechanism to enhance pre-training, aiming to improve coherence and relevance in generated text. GLGE provides baseline models for NLG benchmarks, facilitating comparative analysis. JGR explores joint learning of generation and ranking components. GENIE and AR-diffusion introduce diffusion models for text generation, leveraging continuous denoising and autoregressive properties. CRITIC focuses on LLM self-correction via external tool interaction.

Quick Start & Requirements

Installation typically involves cloning the repository and installing dependencies via pip. Specific models may require PyTorch, Transformers, and potentially CUDA-enabled GPUs for efficient training and inference. Detailed setup instructions and model-specific requirements are available within the respective sub-directories.

Highlighted Details

  • Official implementations of ProphetNet, a pre-trained NLG model with future n-gram prediction.
  • Includes GLGE, a benchmark for natural language generation.
  • Features JGR for joint generator-ranker learning.
  • Introduces GENIE and AR-diffusion, diffusion models for text generation.
  • CRITIC enables LLMs to validate and rectify outputs using external tools.

Maintenance & Community

This project originates from the MSRA NLC team at Microsoft. Further community engagement and updates can be found via Microsoft's open-source channels.

Licensing & Compatibility

The repository is released under the MIT License, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

As a research project, some components may be experimental or under active development, potentially leading to breaking changes or incomplete documentation for specific models. The primary focus is on research contributions rather than production-ready deployment.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
12 stars in the last 90 days

Explore Similar Projects

Starred by Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

pytorch-nlp-notebooks by scoutbee

0%
419
PyTorch tutorials for NLP tasks
created 6 years ago
updated 5 years ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
1 more.

RL4LMs by allenai

0.0%
2k
RL library to fine-tune language models to human preferences
created 3 years ago
updated 1 year ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), and
8 more.

gpt-3 by openai

0.0%
16k
Research paper on large language model few-shot learning
created 5 years ago
updated 4 years ago
Feedback? Help us improve.