Discover and explore top open-source AI tools and projects—updated daily.
xr843Encyclopedic Buddhist digital text platform
Top 88.0% on SourcePulse
Summary
FoJin is a comprehensive Buddhist digital text platform addressing the fragmentation of Buddhist heritage across global databases. It offers researchers and practitioners a unified interface for accessing over 23,500 full-text volumes from 503 sources in 30 languages, enhanced by AI-powered Q&A, a knowledge graph, and advanced search. The platform significantly reduces text discovery time, enabling deeper engagement with Buddhist studies.
How It Works
FoJin aggregates diverse Buddhist digital heritage from 503 sources using PostgreSQL with pgvector for semantic search (HNSW index) and Elasticsearch for keyword search. Its backend is built with FastAPI. Key features include AI Q&A ("XiaoJin") via RAG with multiple LLM providers and tradition-scoped "Master Personas," a knowledge graph visualizing 31K+ entities and 28K+ relations, and a geo-map of 50K+ entities. Data is imported via Python scripts; text content is fetched from original sources, not bundled.
Quick Start & Requirements
Installation uses Docker Compose (docker compose up -d). Prerequisites include Docker and a configured .env file. Initial setup requires importing text content via provided Python scripts (e.g., import_content.py). Dependencies include PostgreSQL, Elasticsearch, and Redis.
Highlighted Details
Maintenance & Community
Active maintenance is indicated by CI/CD badges. Community engagement is facilitated via GitHub Discussions and Discord. Bug reporting and contributions are managed through GitHub Issues and a CONTRIBUTING.md file.
Licensing & Compatibility
The FoJin source code is Apache 2.0 licensed. However, integrated third-party data sources retain their own licenses (e.g., CC BY-NC-SA, CC0). Users must consult the NOTICE file for specific data license details. Commercial use compatibility depends on these varied data licenses.
Limitations & Caveats
Initial setup requires a separate, potentially time-consuming, data import process for text content. Cross-lingual search is a planned future feature. Commercial usability is contingent on the licenses of the numerous third-party data sources.
2 days ago
Inactive
jzbjyb
meilisearch