Discover and explore top open-source AI tools and projects—updated daily.
PratiyushLLM-powered knowledge base from coding sessions
Top 97.7% on SourcePulse
Summary
llm-wiki transforms dormant LLM session transcripts from tools like Claude Code, Codex CLI, Copilot, Cursor, and Gemini into a searchable, interlinked knowledge base. It provides both human-readable static websites and machine-readable exports for AI agents, enabling deeper utilization of past AI interactions.
How It Works
This project implements Andrej Karpathy's LLM Wiki pattern, converting raw session .jsonl files into a multi-layered wiki structure. It first ingests sessions into immutable markdown (raw/), then generates LLM-enhanced wiki pages (wiki/) for sources, entities, and concepts, interlinked via [[wikilinks]]. Finally, it compiles a static HTML site (site/) with global search and AI-consumable exports, leveraging Python's standard library for core functionality.
Quick Start & Requirements
./setup.sh (macOS/Linux) or setup.bat (Windows) for a one-time setup, or pip install -e . for basic installation../build.sh && ./serve.sh to build the static site and start a local development server at http://127.0.0.1:8765.[graph]) or end-to-end testing ([e2e]). Syntax highlighting is loaded from a CDN.pratiyush.github.io/llm-wiki. Full documentation: docs/index.md.Highlighted Details
llms.txt, llms-full.txt, graph.jsonld, per-page .txt/.json siblings, sitemap.xml, rss.xml, and robots.txt.localhost.Maintenance & Community
The project has a clear release history with detailed milestones and versioning, indicating active development. Contribution guidelines are available in CONTRIBUTING.md. No explicit community channels (e.g., Discord, Slack) are listed.
Licensing & Compatibility
The project is released under the permissive MIT license, allowing for commercial use and integration into closed-source projects without significant restrictions.
Limitations & Caveats
Several LLM adapters, including Cursor, Gemini CLI, Copilot Chat, and Copilot CLI, are marked as Beta and require verification against current session formats. The core functionality relies on specific .jsonl transcript formats from supported agents.
1 day ago
Inactive
johnbean393
firecrawl