CodeGen2 by salesforce

Program synthesis research release (ICLR 2023)

Created 2 years ago

271 stars

Top 95.1% on SourcePulse

View on GitHub

2 Experts Love This Project

Omar Sanseviero

DevRel at Google DeepMind

Jeff Hammerbacher

Cofounder of Cloudera

Project Summary

CodeGen2 provides official research releases of large language models (LLMs) for program synthesis, specifically addressing the challenges of training LLMs on both programming and natural languages. It targets researchers and developers working on code generation, autocompletion, and other program synthesis tasks, offering models ranging from 1 billion to 16 billion parameters.

How It Works

CodeGen2 utilizes an auto-regressive sampling approach for program synthesis. The models are trained on a diverse dataset encompassing both natural language and programming languages, enabling them to generate code based on natural language descriptions or to fill in code snippets. This dual-language training is a key differentiator, allowing for more versatile and context-aware code generation.

Quick Start & Requirements

Install/Run: Use Hugging Face Transformers library.
Prerequisites: PyTorch, Hugging Face Transformers.
Resources: Requires significant GPU memory for larger models (e.g., 16B parameter model).
Docs: Hugging Face Hub for model cards and usage examples.

Highlighted Details

Offers four model sizes: 1B, 3.7B, 7B, and 16B parameters.
Supports both causal and infill sampling for program synthesis.
Models are available on Hugging Face Hub for easy integration.
Research presented at ICLR 2023.

Maintenance & Community

Developed by Salesforce Research.
No explicit community links (Discord, Slack) or roadmap provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. However, models released by Salesforce Research on Hugging Face are typically under a permissive license (e.g., Apache 2.0 or MIT), but this should be verified on the specific model card.

Limitations & Caveats

The README does not detail specific limitations, performance benchmarks, or known issues. The significant hardware requirements for larger models may be a barrier to entry for some users.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days