catai by withcatai

CLI tool for local AI assistant using GGUF models

Created 2 years ago

479 stars

Top 63.8% on SourcePulse

1 Expert Loves This Project

tobi

Cofounder of Shopify

Project Summary

CatAI provides a local AI assistant experience, enabling users to run GGUF models on their own computers with a chat UI and a simple Node.js API. It targets developers and power users seeking to leverage large language models offline, offering features like real-time streaming and fast model downloads.

How It Works

CatAI utilizes node-llama-cpp, a Node.js binding for llama.cpp, to run GGUF models efficiently. This approach allows for cross-platform compatibility (Windows, Linux, macOS) and leverages the performance optimizations of llama.cpp for local inference. The project also includes a CLI for model management and a web API for programmatic interaction.

Quick Start & Requirements

Install globally: npm install -g catai
Install a model: catai install meta-llama-3-8b-q4_k_m
Start the server: catai up
Requires Node.js.
Supports multiple platforms including darwin-x64, linux-x64, win32-x64-msvc.
Model downloads default to ~/catai.

Highlighted Details

Auto-detects programming language.
Real-time text streaming.
Fast, multi-threaded model downloads.
Offers a development API for programmatic interaction with models, including JSON schema grammar support.

Maintenance & Community

Project is actively maintained.
Contributions are welcome via a contributing guide.

Licensing & Compatibility

MIT License for the CatAI package itself.
Subject to the llama.cpp license, which is typically MIT.
Compatible with commercial and closed-source applications.

Limitations & Caveats

The project relies on node-llama-cpp which is in beta. Specific platform support depends on the underlying llama.cpp build for node-llama-cpp.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

Jeffrey Morgan

Jeffrey Morgan(Cofounder of Ollama).

ollama-ai by gbaptista

Ruby SDK for local LLM interaction via Ollama API

Created 2 years ago

Updated 1 year ago

Starred by

Victor Taelin

Victor Taelin(Author of Bend, Kind, HVM).

IntelliNode by intelligentnode

JS module for unified AI model access

Created 2 years ago

Updated 2 months ago

lmrouter by LMRouter

AI API router for unified access to diverse model providers

Created 5 months ago

Updated 3 months ago

allchat by msveshnikov

Multimodal AI chat client connecting diverse models and tools

Created 1 year ago

Updated 11 months ago

omnitool by omnitool-ai

Open-source "AI Lab in a box" desktop app for generative AI model interaction

Created 2 years ago

Updated 1 year ago

LlamaPen by ImDarkTom

Web-based GUI for local LLMs

Created 11 months ago

Updated 2 months ago

Starred by

Abhishek Thakur

Abhishek Thakur(World's First 4x Kaggle GrandMaster) and

Lewis Tunstall

Lewis Tunstall(Research Engineer at Hugging Face).

aiaio by abhishekkrthakur

Web UI for interacting with AI models

Created 11 months ago

Updated 1 month ago

Alpaca by Jeffser

Ollama client for local AI model management and chat

Created 1 year ago

Updated 3 days ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Tomas Valenta

Tomas Valenta(Cofounder of E2B), and

6 more.

GodMode by smol-ai

AI chat browser for accessing multiple LLMs' web apps

Created 2 years ago

Updated 1 year ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen), and

2 more.

inference by xorbitsai

Model serving library for language, speech, and multimodal models

Created 2 years ago

Updated 1 day ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

AstrBot by AstrBotDevs

LLM chatbot/framework for multiple platforms

Created 3 years ago

Updated 15 hours ago

Starred by

Ettore Di Giacinto

Ettore Di Giacinto(Author of LocalAI),

Justin Torre

Justin Torre(Cofounder of Helicone), and

2 more.

big-AGI by enricoros

AI suite for advanced AI/AGI functions, deployable on-prem or cloud

Created 2 years ago

Updated 23 hours ago

Feedback? Help us improve.