Bookmarks search engine with knowledge graph
Top 51.2% on sourcepulse
This project provides an open-source personal bookmark search engine that automatically extracts and indexes content from social media platforms like GitHub, HackerNews, and Twitter, along with Zotero documents. It aims to create a navigable knowledge graph of your saved content, enhancing searchability and discovery for users managing extensive digital collections.
How It Works
The system utilizes a GitHub Actions workflow that runs twice daily to fetch starred repositories, liked tweets, upvoted HackerNews posts, and Zotero records. Extracted data is stored in JSON files (database.json
for records, triples.json
for the knowledge graph) and a search index (retriever.pkl
). The application is deployed via Fly.io, with an updated version automatically pushed after data extraction. OpenAI's API can be optionally used for re-ranking search results.
Quick Start & Requirements
flyctl auth login
and fly deploy
. Local development with Docker: export OPENAI_API_KEY="..."
then make launch
.flyctl
client, OpenAI API key (optional), Twitter API token, Zotero API key and library ID. GitHub and Twitter user handles for data sources.Highlighted Details
Maintenance & Community
The project is maintained by Raphaël Sourty. No specific community channels or roadmap links are provided in the README.
Licensing & Compatibility
Limitations & Caveats
The README indicates that the GitHub Pages URLs need manual updating after API deployment. The project is inspired by Semanlink, suggesting potential overlap in functionality or design.
10 hours ago
1 day