kronk by ardanlabs

Go-native engine for local AI model inference

Created 7 months ago

682 stars

Top 49.0% on SourcePulse

Project Summary

This project addresses the need for efficient, hardware-accelerated local inference of open-source LLMs within Go applications. It targets Go developers seeking to integrate AI capabilities directly into their software without relying on external APIs. Kronk provides a high-level, OpenAI-compatible interface and a model server, simplifying the adoption of local AI models.

How It Works

Kronk embeds llama.cpp into Go applications using the yzma module for efficient, hardware-accelerated GGUF model inference. It exposes a familiar, OpenAI-compatible API for chat completions, embeddings, and reranking. A model server component further simplifies deployment and interaction with local models.

Quick Start & Requirements

Install the CLI via go install github.com/ardanlabs/kronk/cmd/kronk@latest. Run the model server with $ make kronk-server or $ kronk server start. Requires the Go toolchain and GGUF models. Extensive hardware acceleration is supported across Linux, macOS, and Windows. Documentation and examples are available via https://kronkai.com.

Highlighted Details

Comprehensive hardware acceleration: Linux (CUDA, Vulkan, HIP, ROCm, SYCL), macOS (Metal), Windows (CUDA, Vulkan, HIP, SYCL, OpenCL).
Supports text, audio, and vision multimodal models.
Model server compatible with OpebWebUI, Cline, and Claude Code.
yzma provides support for over 94% of llama.cpp functionality.

Maintenance & Community

Owned by Ardan Labs (Bill Kennedy). Contact: hello@ardanlabs.com. Community contributions are encouraged. Social links for the owner are provided.

Licensing & Compatibility

Copyright 2025-2026 Ardan Labs. No specific open-source license is stated in the README. This requires further investigation for commercial use or integration into proprietary applications.

Limitations & Caveats

yzma supports ~94% of llama.cpp features; consult yzma's ROADMAP.md for specifics. The project appears recent, with copyright dates 2025-2026.

Health Check

Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)

43

Issues (30d)

11

Star History

31 stars in the last 30 days

Explore Similar Projects

Starred by

Jeffrey Morgan

Jeffrey Morgan(Cofounder of Ollama).

ollama-ai by gbaptista

Ruby SDK for local LLM interaction via Ollama API

Created 2 years ago

Updated 1 year ago

mlx-serve by ddalcu

Native LLM inference server for Apple Silicon

Created 4 months ago

Updated 7 hours ago

SwiftAI by mi12labs

Swift library for building LLM apps on iOS and macOS

Created 10 months ago

Updated 6 months ago

Lynkr by Fast-Editor

Universal LLM proxy for AI coding tools

Created 7 months ago

Updated 8 hours ago

qvac by tetherto

Local-first, P2P AI SDK for cross-platform applications

Created 6 months ago

Updated 5 hours ago

Atomic-Chat by AtomicBot-ai

Local AI chat and assistant platform

Created 3 months ago

Updated 1 day ago

proxy by routatic

LLM API proxy enabling Claude Code to use OpenCode Go models

Created 2 months ago

Updated 3 days ago

deepseek4j by pig-mesh

Java SDK for DeepSeek models

Created 1 year ago

Updated 2 months ago

picobot by louisho5

AI agent for universal deployment

Created 5 months ago

Updated 3 months ago

Starred by

Shawn Wang

Shawn Wang(Editor of Latent Space).

react-native-ai by dabit3

Full-stack framework for cross-platform mobile AI app development

Created 2 years ago

Updated 22 hours ago

shimmy by Michael-A-Kuykendall

A lightweight, local-first AI inference server

Created 10 months ago

Updated 1 week ago

Starred by

Tobi Lutke

Tobi Lutke(Cofounder of Shopify),

Simon Willison

Simon Willison(Coauthor of Django), and

12 more.

jan by janhq

Local AI assistant for offline LLM use

Created 2 years ago

Updated 10 hours ago

Feedback? Help us improve.