llama_cpp_dart by netdur

Dart bindings for llama.cpp

Created 2 years ago

275 stars

Top 94.1% on SourcePulse

Project Summary

This project provides Dart bindings for the llama.cpp C++ library, enabling developers to integrate advanced text generation capabilities into Dart and Flutter applications. It offers multiple levels of abstraction, from low-level FFI bindings for maximum control to a high-level, object-oriented API and a managed isolate for non-blocking Flutter integration, catering to a wide range of use cases and developer preferences.

How It Works

The library leverages Dart's Foreign Function Interface (FFI) to directly call functions within a compiled llama.cpp shared library. This allows for efficient execution of large language models. It abstracts these FFI calls into a simplified, object-oriented Dart API and further provides a managed isolate solution, which is ideal for Flutter applications, ensuring that model inference does not block the UI thread.

Quick Start & Requirements

Installation: Add llama_cpp_dart to your pubspec.yaml.
Prerequisites: Dart SDK or Flutter SDK, and a compiled llama.cpp shared library (.dylib, .so, or .dll). The llama.cpp repository must be cloned and compiled, ensuring support for your target hardware (CPU, Metal, CUDA, ROCm).
Setup: Building llama.cpp may take some time depending on the hardware and compilation options.
Documentation: Examples for low-level bindings, high-level wrappers, and managed isolates are available within the repository.

Highlighted Details

Supports multiple integration levels: low-level FFI, high-level wrapper, and managed isolate.
Provides flexibility in choosing model formats (e.g., Llama 2, ChatML, Gemma, Mistral) and quantization levels (e.g., F16, Q4_K_M).
Offers guidance on model selection based on use cases like text generation, embeddings, code generation, and multilingual support.

Maintenance & Community

The project is hosted on GitHub at netdur/llama_cpp_dart. Specific community links or contributor details are not explicitly mentioned in the README.

Licensing & Compatibility

The project is licensed under the MIT License, which permits commercial use and linking with closed-source applications.

Limitations & Caveats

The performance and quality of the LLM inference are dependent on the underlying llama.cpp library, the chosen model, quantization, and the user's hardware. The README does not detail specific performance benchmarks for the Dart bindings themselves.

llama_cpp_dart by netdur

Explore Similar Projects

java-llama.cpp by kherud

InferLLM by MegEngine

go-llama.cpp by go-skynet

LLM.swift by eastriverlee

Jlama by tjake

torchchat by pytorch

text-generation-webui-colab by camenduru

LLamaSharp by SciSharp

llamafile by mozilla-ai

llama-models by meta-llama

dalai by cocktailpeanut

text-generation-webui by oobabooga