karakeep  by karakeep-app

Self-hostable app for bookmarking links, notes, images, and PDFs

Created 1 year ago
19,785 stars

Top 2.2% on SourcePulse

GitHubView on GitHub
Project Summary

Karakeep is a self-hostable application designed for users who want to manage a diverse collection of digital content, including links, notes, and images. It aims to solve the problem of disorganized digital hoarding by providing features like automatic content fetching, full-text search, and AI-powered tagging, making it easier to organize and retrieve saved items.

How It Works

Karakeep utilizes a modern web stack including NextJS with the App Router, Drizzle for database management, and tRPC for client-server communication. It employs Puppeteer for crawling bookmark content and Meilisearch for efficient full-text search capabilities. A key differentiator is its AI-based automatic tagging, which can leverage OpenAI or local models via Ollama, enhancing content discoverability. It also includes OCR for text extraction from images and Monolith for full-page archival to combat link rot.

Quick Start & Requirements

  • Installation and setup instructions are available in the Documentation.
  • A public demo is available at https://try.karakeep.app with credentials: demo@karakeep.app / demodemo.
  • The project is built with NextJS, Drizzle, NextAuth, tRPC, Puppeteer, OpenAI, and Meilisearch. Specific dependency versions are not explicitly listed in the README.

Highlighted Details

  • Supports bookmarking links, notes, and images/PDFs.
  • Features automatic fetching of link titles, descriptions, and images.
  • Includes AI-based automatic tagging (OpenAI or local Ollama models) and OCR for image text extraction.
  • Offers browser extensions (Chrome, Firefox), mobile apps (iOS, Android), and full-page archival.

Maintenance & Community

The project is under heavy development and is described as "far from stable." There are no specific mentions of notable contributors, sponsorships, or partnerships in the README. Community support channels like Discord or Slack are not explicitly linked.

Licensing & Compatibility

The README does not specify a license. This is a critical omission for evaluating commercial use or closed-source integration.

Limitations & Caveats

The application is explicitly stated to be under "heavy development" and "far from stable," indicating potential for bugs and breaking changes. The lack of a specified license is a significant caveat for adoption.

Health Check
Last Commit

4 days ago

Responsiveness

1 day

Pull Requests (30d)
40
Issues (30d)
93
Star History
585 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Simon Willison Simon Willison(Coauthor of Django).

semantra by freedmand

0.1%
3k
CLI tool for semantic document search
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.