Discover and explore top open-source AI tools and projects—updated daily.
deusyuAI-powered book translation pipeline
New!
Top 54.1% on SourcePulse
This project provides a Claude Code skill for translating entire books (PDF, DOCX, EPUB) into various languages using a distributed, parallel subagent architecture. It targets users needing efficient, robust, and accurate book translation, offering a significant improvement over single-session translation methods by preventing context truncation and enabling resumable workflows.
How It Works
The core innovation lies in restructuring book translation as a Claude Code Skill. Input documents are converted to Markdown, then split into manageable chunks. Each chunk is processed by an independent subagent with a fresh context window, enabling parallel translation (defaulting to 8 concurrent agents) and preventing context accumulation or output truncation. A manifest tracks chunk hashes, allowing for manifest-driven integrity checks and validation before merging translated chunks. The pipeline then unifies these chunks, converting them back into multiple output formats (HTML, DOCX, EPUB, PDF) via Pandoc and Calibre.
Quick Start & Requirements
npx skills add deusyu/translate-book -a claude-code -g (recommended). Alternatives include clawhub install translate-book or Git cloning. Translation is initiated within Claude Code via translate /path/to/book.pdf to [language].ebook-convert in PATH), Pandoc, Python 3 with pypandoc (pip install pypandoc), and optionally beautifulsoup4 (pip install beautifulsoup4).Highlighted Details
Maintenance & Community
No specific details regarding contributors, sponsorships, or community channels (e.g., Discord, Slack) are provided in the README.
Licensing & Compatibility
The project is released under the MIT License. This license generally permits commercial use and integration into closed-source projects without significant restrictions.
Limitations & Caveats
Successful operation requires the installation and correct configuration of external tools: Claude Code CLI, Calibre, and Pandoc. Re-running the translation for minor changes (e.g., metadata, templates) necessitates either a fresh run or manual deletion of existing output artifacts. Manifest validation failures indicate that source chunks have been altered since the initial conversion. PDF generation is dependent on Calibre's PDF output capabilities.
1 week ago
Inactive
datalab-to
opendatalab