BiomedGPT  by taokz

Vision-language foundation model for diverse biomedical tasks

created 2 years ago
672 stars

Top 51.2% on sourcepulse

GitHubView on GitHub
Project Summary

BiomedGPT is a generalist vision-language foundation model designed for diverse biomedical tasks, targeting researchers and developers in the medical AI domain. It aims to provide a unified framework for tasks like visual question answering, image captioning, and text summarization within the biomedical field.

How It Works

BiomedGPT is built upon the OFA (One-For-All) framework, leveraging a multi-modal and multi-task pre-training approach with extensive biomedical datasets. This strategy allows the model to learn transferable representations across various data modalities and tasks, enabling zero-shot or few-shot performance on downstream applications without task-specific architectures.

Quick Start & Requirements

  • Installation: Clone the repository and install dependencies using conda create --name biomedgpt python=3.7.4, python -m pip install pip==21.2.4, and pip install -r requirements.txt.
  • Prerequisites: Linux environment, Python 3.7.4.
  • Checkpoints: Pretrained checkpoints are available via Dropbox. Transformers-compatible weights are accessible through a Hugging Face collection.
  • Resources: A Colab notebook is provided for inference.
  • Documentation: Details on datasets and fine-tuned checkpoints are available in datasets.md and checkpoints.md respectively.

Highlighted Details

  • Generalist vision-language foundation model for biomedical tasks.
  • Pre-trained and fine-tuned on multi-modal & multi-task biomedical datasets.
  • Supports zero-shot inference for tasks like Visual Question Answering (VQA).
  • Includes scripts for pretraining, fine-tuning, and inference across VQA, image captioning, text summarization, natural language inference, and image classification.

Maintenance & Community

The project is associated with a Nature Medicine 2024 publication. Further questions can be directed to the authors or via GitHub issues.

Licensing & Compatibility

BiomedGPT is strictly for academic research purposes. Commercial and clinical uses are prohibited due to the inherited non-commercial license from the OFA framework, lack of healthcare setting licensing, and insufficient security measures for medical diagnoses.

Limitations & Caveats

The current implementation is not designed for chatbot or copilot applications, with ongoing work for improved conversational abilities. Extensive experiments with Huggingface's transformers have not been conducted, and full alignment with Fairseq results is not guaranteed.

Health Check
Last commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
1
Star History
37 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
1 more.

BioGPT by microsoft

0.1%
4k
BioGPT is a generative pre-trained transformer for biomedical text
created 3 years ago
updated 1 year ago
Feedback? Help us improve.