ollama-helm  by otwld

Helm chart for deploying Ollama on Kubernetes

Created 1 year ago
496 stars

Top 62.5% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This Helm chart provides a Kubernetes deployment for Ollama, enabling users to run large language models locally within a cluster. It targets Kubernetes users, particularly those needing GPU acceleration for LLM inference, and simplifies the setup and management of Ollama instances.

How It Works

The chart deploys Ollama as a Kubernetes Deployment, allowing for configurable resource allocation, GPU integration (NVIDIA and AMD), and persistent storage via PersistentVolumeClaims. It supports pre-loading models at startup and creating models from templates, offering flexibility in LLM deployment.

Quick Start & Requirements

  • Install:
    helm repo add otwld https://otwld.github.io/ollama-helm/
    helm repo update
    helm install ollama otwld/ollama --namespace ollama --create-namespace
    
  • Requirements: Kubernetes >= 1.16.0-0 (CPU), >= 1.26.0-0 (GPU). GPU support requires specific NVIDIA or AMD drivers and compatible hardware.
  • Docs: Ollama Documentation, Ollama-Helm Chart

Highlighted Details

  • GPU support for NVIDIA and AMD, including MIG for NVIDIA.
  • Ability to pull and run specified models on startup.
  • Support for creating models from templates or ConfigMaps.
  • Optional Ingress configuration for external access.
  • Persistent storage for Ollama data.

Maintenance & Community

  • Maintained by Jean Baptiste Detroyes and Nathan Tréhout.
  • Community support via OTWLD Discord and Ollama-Helm GitHub issues.

Licensing & Compatibility

  • The chart itself is typically licensed under a permissive license (e.g., Apache 2.0, though not explicitly stated in the README). Ollama's underlying license should be consulted for specific usage terms.

Limitations & Caveats

  • GPU support may vary depending on specific hardware and Kubernetes versions. Not all GPUs are guaranteed to be supported, especially AMD.
  • Upgrading from older chart versions (0.X.X to 1.X.X) requires migration of model configuration.
Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
0
Star History
16 stars in the last 30 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
24 more.

open-webui by open-webui

0.6%
110k
Self-hosted AI platform for local LLM deployment
Created 1 year ago
Updated 1 day ago
Feedback? Help us improve.