Protein models & API for generative tasks and representation learning
Top 22.4% on sourcepulse
This repository provides access to EvolutionaryScale's flagship protein language models, ESM3 (generative) and ESM C (representation learning). It's designed for researchers and developers in bioinformatics and computational biology seeking advanced tools for protein sequence, structure, and function prediction and generation. The library offers a unified interface for local execution and cloud-based inference via the EvolutionaryScale Forge API and AWS SageMaker.
How It Works
ESM3 is a multimodal, generative masked language model that reasons across protein sequence, structure, and function. It uses a scalable transformer backbone, allowing iterative generation by sampling masked tokens. ESM C is a parallel representation learning model, designed as a drop-in replacement for ESM2, offering significant performance and efficiency gains. Both models leverage discrete token representations for their respective tasks.
Quick Start & Requirements
pip install esm
Highlighted Details
pip install flash-attn
.Maintenance & Community
Licensing & Compatibility
LICENSE.md
for details.Limitations & Caveats
2 weeks ago
1 week