OpenBioMed  by PharMolix

AI toolkit for biomedicine

created 2 years ago
900 stars

Top 41.2% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

OpenBioMed is a Python deep learning toolkit designed for AI-powered biomedicine research, offering flexible APIs for multi-modal data (molecules, proteins, text, single cells) and over 20 tools for applications like drug discovery and multi-modal challenges. It enables researchers to build custom workflows and LLM-driven agents for complex scientific tasks.

How It Works

OpenBioMed employs a unified data processing pipeline to standardize heterogeneous biomedical data. It provides access to over 20 pre-trained models, including specialized foundation models like PharmolixFM for molecular and protein interactions, BioMedGPT-R1 for multimodal biomedical reasoning, and LangCell for single-cell transcriptomics. The framework supports building collaborative workflows and LLM agents, facilitating complex research by connecting multiple tools and leveraging large language models for insights.

Quick Start & Requirements

  • Installation: Requires conda for environment setup. Key commands involve creating a conda environment (conda create -n OpenBioMed python=3.9), activating it (conda activate OpenBioMed), and installing PyTorch (v1.13.1 with specific CUDA version), PyTorch Geometric libraries, and project requirements (pip install -r requirements.txt).
  • Prerequisites: CUDA toolkit (v11.7 recommended), Python 3.9. Additional installations are needed for visualization (PyMOL, imageio) and AutoDockVina (meeko, pdb2pqr, vina).
  • Resources: Setup involves significant dependency installation. Docker images are available for easier deployment.
  • Documentation: Tutorials are available as Jupyter notebooks.

Highlighted Details

  • Features PharmolixFM, a multimodal foundation model achieving competitive performance with AlphaFold3 on protein-molecule docking.
  • Includes BioMedGPT-R1, a multimodal biomedical reasoning model performing on par with commercial models on biomedical QA benchmarks.
  • Offers LangCell for zero-shot cell identity understanding in single-cell transcriptomics.
  • Provides BioMedGPT-10B, a multimodal foundation model for biomedical QA and cross-modal molecule/protein tasks.

Maintenance & Community

The project is a collaboration between PharMolix Inc. and the Institute of AI Industry Research (AIR), Tsinghua University. For technical questions or suggestions, users are encouraged to open an issue. Commercial support or collaboration inquiries can be directed to opensource@pharmolix.com.

Licensing & Compatibility

The repository is licensed under the MIT License. However, the use of specific models like BioMedGPT-LM-7B and BioMedGPT-10B is subject to an Acceptable Use Policy, which prohibits generating content that violates laws or regulations.

Limitations & Caveats

The Acceptable Use Policy for certain models strictly prohibits their use for generating harmful or illegal content. The project emphasizes responsible and ethical use, and BioMedGPT models should not be used to provide services to the general public.

Health Check
Last commit

1 month ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
2
Star History
41 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera).

Biomni by snap-stanford

2.8%
2k
Biomedical AI agent for autonomous research tasks
created 4 months ago
updated 19 hours ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
1 more.

BioGPT by microsoft

0.1%
4k
BioGPT is a generative pre-trained transformer for biomedical text
created 3 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Didier Lopes Didier Lopes(Founder of OpenBB), and
1 more.

qlib by microsoft

0.7%
28k
AI platform for quantitative investment research and production
created 5 years ago
updated 4 days ago
Feedback? Help us improve.