kronk  by ardanlabs

Go-native engine for local AI model inference

Created 4 months ago
259 stars

Top 97.7% on SourcePulse

GitHubView on GitHub
Project Summary

This project addresses the need for efficient, hardware-accelerated local inference of open-source LLMs within Go applications. It targets Go developers seeking to integrate AI capabilities directly into their software without relying on external APIs. Kronk provides a high-level, OpenAI-compatible interface and a model server, simplifying the adoption of local AI models.

How It Works

Kronk embeds llama.cpp into Go applications using the yzma module for efficient, hardware-accelerated GGUF model inference. It exposes a familiar, OpenAI-compatible API for chat completions, embeddings, and reranking. A model server component further simplifies deployment and interaction with local models.

Quick Start & Requirements

Install the CLI via go install github.com/ardanlabs/kronk/cmd/kronk@latest. Run the model server with $ make kronk-server or $ kronk server start. Requires the Go toolchain and GGUF models. Extensive hardware acceleration is supported across Linux, macOS, and Windows. Documentation and examples are available via https://kronkai.com.

Highlighted Details

  • Comprehensive hardware acceleration: Linux (CUDA, Vulkan, HIP, ROCm, SYCL), macOS (Metal), Windows (CUDA, Vulkan, HIP, SYCL, OpenCL).
  • Supports text, audio, and vision multimodal models.
  • Model server compatible with OpebWebUI, Cline, and Claude Code.
  • yzma provides support for over 94% of llama.cpp functionality.

Maintenance & Community

Owned by Ardan Labs (Bill Kennedy). Contact: hello@ardanlabs.com. Community contributions are encouraged. Social links for the owner are provided.

Licensing & Compatibility

Copyright 2025-2026 Ardan Labs. No specific open-source license is stated in the README. This requires further investigation for commercial use or integration into proprietary applications.

Limitations & Caveats

yzma supports ~94% of llama.cpp features; consult yzma's ROADMAP.md for specifics. The project appears recent, with copyright dates 2025-2026.

Health Check
Last Commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
43
Issues (30d)
2
Star History
43 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

JittorLLMs by Jittor

0.0%
2k
Low-resource LLM inference library
Created 3 years ago
Updated 1 year ago
Feedback? Help us improve.