gemma  by google-deepmind

JAX library for using and fine-tuning Gemma LLMs

created 1 year ago
3,564 stars

Top 13.8% on sourcepulse

GitHubView on GitHub
Project Summary

Gemma provides an open-weights Large Language Model (LLM) library from Google DeepMind, built on Gemini research. It enables users to utilize and fine-tune Gemma models, offering a JAX-based implementation for researchers and developers working with advanced AI models.

How It Works

Gemma is implemented as a JAX library, allowing for efficient computation on CPUs, GPUs, and TPUs. The library provides pre-trained model checkpoints and tools for sampling, including multi-turn and multi-modal conversations. Its design leverages JAX's automatic differentiation and hardware acceleration capabilities for both inference and fine-tuning.

Quick Start & Requirements

Highlighted Details

  • Supports multi-turn and multi-modal conversations.
  • Includes examples for fine-tuning and LoRA.
  • Based on Google DeepMind's Gemini research.

Maintenance & Community

This project is not an official Google product. Contributions are welcome via contributing guidelines.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README does not specify the license, which may impact commercial use. Model weights need to be downloaded separately as per documentation.

Health Check
Last commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
21
Issues (30d)
8
Star History
355 stars in the last 90 days

Explore Similar Projects

Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), Thomas Wolf Thomas Wolf(Cofounder of Hugging Face), and
3 more.

levanter by stanford-crfm

0.5%
628
Framework for training foundation models with JAX
created 3 years ago
updated 23 hours ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
2 more.

gemma_pytorch by google

0.1%
6k
PyTorch implementation for Google's Gemma models
created 1 year ago
updated 2 months ago
Feedback? Help us improve.