CodeXGLUE by microsoft

Benchmark for code intelligence tasks

Created 5 years ago

1,802 stars

Top 23.5% on SourcePulse

View on GitHub

7 Experts Love This Project

Shizhe Diao

Author of LMFlow; Research Scientist at NVIDIA

Omar Khattab

Coauthor of DSPy, ColBERT; Professor at MIT

and 3 more!

Project Summary

CodeXGLUE is a comprehensive benchmark dataset and open challenge designed to advance AI for code intelligence, targeting researchers and practitioners in software engineering and artificial intelligence. It provides a standardized platform for evaluating and comparing models across a wide array of code-related tasks, aiming to boost developer productivity.

How It Works

CodeXGLUE addresses the lack of standardized evaluation for code intelligence by curating 14 datasets across 10 diverse tasks, including code-code translation, defect detection, code completion, code search, and code summarization. It supports models inspired by NLP advancements, offering baseline implementations like CodeBERT (BERT-style) for understanding and CodeGPT (GPT-style) for generation, along with an Encoder-Decoder framework for sequence-to-sequence tasks.

Quick Start & Requirements

Access to datasets and baseline models is available via HuggingFace datasets.
Specific task repositories contain evaluation methodologies.
Training and inference time costs are provided for 2 P100 GPUs.

Highlighted Details

Covers 10 diversified code intelligence tasks: code-code, text-code, code-text, and text-text.
Includes 14 datasets, with several newly introduced for broader evaluation.
Provides three baseline pipelines: CodeBERT, CodeGPT, and Encoder-Decoder.
Facilitates model evaluation and comparison through an open challenge submission process.

Maintenance & Community

This project is a research initiative from Microsoft Research Asia, Developer Division, and Bing. Further details on participation and submission are available via email to codexglue@microsoft.com.

Licensing & Compatibility

The code is released under the MIT License, while the datasets are governed by the Computational Use of Data Agreement (C-UDA) License.

Limitations & Caveats

The README does not specify any explicit limitations or caveats regarding model performance, dataset biases, or ongoing development status.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History