kronk  by ardanlabs

Go-native engine for local AI model inference

Created 6 months ago
633 stars

Top 52.0% on SourcePulse

GitHubView on GitHub
Project Summary

This project addresses the need for efficient, hardware-accelerated local inference of open-source LLMs within Go applications. It targets Go developers seeking to integrate AI capabilities directly into their software without relying on external APIs. Kronk provides a high-level, OpenAI-compatible interface and a model server, simplifying the adoption of local AI models.

How It Works

Kronk embeds llama.cpp into Go applications using the yzma module for efficient, hardware-accelerated GGUF model inference. It exposes a familiar, OpenAI-compatible API for chat completions, embeddings, and reranking. A model server component further simplifies deployment and interaction with local models.

Quick Start & Requirements

Install the CLI via go install github.com/ardanlabs/kronk/cmd/kronk@latest. Run the model server with $ make kronk-server or $ kronk server start. Requires the Go toolchain and GGUF models. Extensive hardware acceleration is supported across Linux, macOS, and Windows. Documentation and examples are available via https://kronkai.com.

Highlighted Details

  • Comprehensive hardware acceleration: Linux (CUDA, Vulkan, HIP, ROCm, SYCL), macOS (Metal), Windows (CUDA, Vulkan, HIP, SYCL, OpenCL).
  • Supports text, audio, and vision multimodal models.
  • Model server compatible with OpebWebUI, Cline, and Claude Code.
  • yzma provides support for over 94% of llama.cpp functionality.

Maintenance & Community

Owned by Ardan Labs (Bill Kennedy). Contact: hello@ardanlabs.com. Community contributions are encouraged. Social links for the owner are provided.

Licensing & Compatibility

Copyright 2025-2026 Ardan Labs. No specific open-source license is stated in the README. This requires further investigation for commercial use or integration into proprietary applications.

Limitations & Caveats

yzma supports ~94% of llama.cpp features; consult yzma's ROADMAP.md for specifics. The project appears recent, with copyright dates 2025-2026.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
81
Issues (30d)
13
Star History
190 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

JittorLLMs by Jittor

0%
2k
Low-resource LLM inference library
Created 3 years ago
Updated 1 year ago
Feedback? Help us improve.