cat by celaraze

LLM assistant with permanent memory

Created 5 years ago

909 stars

Top 39.9% on SourcePulse

Project Summary

Sebastian is an LLM assistant designed to overcome the context limitations of traditional chatbots by providing permanent memory. It targets users who need a conversational AI that can recall past interactions and evolving user preferences across extended periods, offering a more personalized and context-aware experience akin to a digital Jarvis.

How It Works

Sebastian employs Retrieval Augmented Generation (RAG) and embeddings to manage memory, rather than relying on context window tokens. This approach allows it to store and retrieve information indefinitely, enabling features like automatic memory discovery, iteration (updating stored information), association (linking related facts), and consolidation (periodically reviewing and optimizing stored knowledge).

Quick Start & Requirements

Deployment: Docker Compose is recommended.
Prerequisites: Docker and Docker Compose installed.
LLM Inference: Requires API keys for either Qwen2-Max (via AliCloud) or GPT-4o (via OpenAI). Local Ollama support is planned.
Setup: Edit docker-compose.yml to set LANGUAGE, DASHSCOPE_API_KEY (for Chinese) or OPENAI_API_KEY (for English), and a TOKEN. Run docker compose up -d.
API Interaction: Use curl commands to interact via text or audio endpoints.
Documentation: Official Docker website for installation.

Highlighted Details

Permanent memory breaks traditional LLM context limitations.
Automatic memory discovery, iteration, association, and consolidation.
Supports private deployment for data security.
Handles both text and voice conversations.

Maintenance & Community

The project welcomes contributions via issues and pull requests. Sponsorship is available via Afdian.net. Acknowledgments are given to Gitee AI, Alibaba Cloud, and JetBrains.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Currently relies on cloud-based LLM inference services (Qwen2-Max, GPT-4o), requiring API keys and incurring potential costs. Local Ollama deployment is planned but not yet available. The specific license is not mentioned, which may impact commercial adoption.

cat by celaraze

Explore Similar Projects

ChatGPTCLIBot by LagPixelLOL

ellama by s-kostyaev

XVERSE-13B by xverse-ai

chatgpt-memory by continuum-llms

LongMem by Victorwz

memory-agent by langchain-ai

mem0-chrome-extension by mem0ai

mcp-knowledge-graph by shaneholloman

MemMachine by MemMachine

LongtermChatExternalSources by daveshap

chatGPT-shell-cli by 0xacx

mem0 by mem0ai