Discover and explore top open-source AI tools and projects—updated daily.
Z80-μLM is a novel micro language model designed for extreme resource constraints, enabling conversational AI on 8-bit Z80 processors with only 64KB of RAM. It targets retrocomputing enthusiasts and researchers interested in the minimal requirements for AI personality. The project delivers a functional chatbot and game playable on vintage hardware, demonstrating that even minimal systems can exhibit engaging, albeit simple, AI interactions.
How It Works
The core innovation lies in aggressive quantization and a unique input encoding. Models use 2-bit weights ({-2, -1, 0, +1}), packed four per byte, with all inference using Z80-native 16-bit integer arithmetic, eschewing floating-point. Input text is transformed into an abstract "tag cloud" via trigram hashing into 128 buckets, providing tolerance for typos and word order variations. This enables a ~40KB .COM binary to run inference character-by-character on a 4MHz Z80.
Quick Start & Requirements
Training is conducted in Python, with models exported as CP/M .COM binaries. The primary interaction method involves running the compiled binary directly on a CP/M system. Prerequisites include a Z80 CPU, 64KB RAM, and a CP/M environment. Tools for generating training data using Ollama or Claude API are available. Further insights into training are detailed in TRAINING.md.
Highlighted Details
tinychat conversational bot and guess 20 Questions game.Maintenance & Community
No specific details on maintainers, community channels (like Discord/Slack), or project roadmap were found in the provided README text.
Licensing & Compatibility
The project is offered under either the MIT or Apache-2.0 license, allowing for flexible use and integration. It is specifically designed for compatibility with CP/M operating systems and vintage Z80-based hardware.
Limitations & Caveats
Input processing relies on abstract hashing, not semantic understanding, limiting grammar grasp, deep multi-turn context tracking, or novel sentence generation. It is not an AI approaching general intelligence but a system for personality and interaction on highly constrained hardware.
5 days ago
Inactive
bigcode-project
minimaxir
microsoft
meta-pytorch
BlinkDL
MoonshotAI