sherpa by Bip-Rep

Mobile app demo for local chatbot using llama.cpp

Created 2 years ago

324 stars

Top 84.1% on SourcePulse

Project Summary

This project provides a mobile implementation of the llama.cpp model, enabling offline chatbot functionality similar to ChatGPT on various devices. It targets users who want to experiment with large language models directly on their mobile phones, offering a convenient way to run these models without relying on cloud services.

How It Works

The app leverages the llama.cpp library, recompiled for mobile platforms. It utilizes the GGML format for model files, allowing efficient execution of LLaMA and Vicuna models directly on the device's hardware. This approach avoids the need for constant internet connectivity and reduces latency compared to cloud-based solutions.

Quick Start & Requirements

Download a LLaMA model (e.g., 7B) from Meta, rename it to ggml-model.bin, and place it in the device's download folder.
Run the Flutter-based app on your mobile device (Windows, macOS, Android).
Requires a LLaMA model file from Meta.

Highlighted Details

Demonstrates offline chatbot functionality on mobile devices.
Supports Vicuna models and allows selection of target .bin models.
Works on Windows, macOS, and Android.
Offers preprompt customization for improved output.

Maintenance & Community

The project is associated with ggerganov/llama.cpp. Further community and development details are not explicitly provided in the README.

Licensing & Compatibility

The project itself appears to be open-source, but it relies on LLaMA models officially distributed by Meta for research purposes. Compatibility is primarily for mobile platforms (Windows, macOS, Android).

Limitations & Caveats

The app developers do not provide the LLaMA models, and users must obtain them separately from Meta. The README mentions a demo running on a OnePlus 7 without speed-up, implying potential performance limitations on less powerful hardware.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days