Paper2Any  by OpenDCAI

AI transforms research papers into editable figures and slides

Created 4 months ago
1,529 stars

Top 27.0% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

Paper2Any addresses the challenge of transforming research papers, text, or topics into editable visual assets like figures, technical diagrams, and presentation slides. It targets researchers and technical professionals, offering a streamlined workflow to generate essential academic and presentation materials efficiently.

How It Works

This project employs AI-powered multimodal workflows to process inputs such as PDF documents, screenshots, or plain text. It leverages specialized agents and toolkits to generate editable outputs including scientific figures (model architectures, roadmaps, plots), presentation slides, and video scripts. The approach emphasizes one-click generation and editable outputs in formats like PPTX and SVG, aiming for speed and user-friendliness.

Quick Start & Requirements

Installation is recommended via Conda with Python 3.11 (Linux) or 3.12 (Windows). Key steps involve cloning the repository, installing base dependencies (requirements-base.txt), and then specific Paper2Any dependencies (requirements-paper.txt). Crucial external requirements include a LaTeX engine (tectonic via conda), system packages like inkscape, libreoffice, poppler-utils, and wkhtmltopdf (on Ubuntu). Environment variables for API keys and optional GPU configurations are necessary. Supabase credentials are also required for both frontend and backend services. Native Windows installation is noted as less recommended than Linux/WSL. An online demo is available at http://dcai-paper2any.nas.cpolar.cn/.

Highlighted Details

  • Paper2Figure: Generates editable scientific figures (model diagrams, technical roadmaps, experimental plots) from paper content.
  • Paper2PPT: Creates editable slide decks from papers or text, supporting long documents and intelligent content extraction.
  • PDF2PPT: Offers layout-preserving conversion of PDFs into editable PPTX files.
  • PPT Smart Beautification: Applies AI-based optimization and style transfer to presentation slides.

Maintenance & Community

The project is undergoing a significant architectural split, separating multimodal paper workflows into this repository (Paper2Any) and a general-purpose multi-agent dataflow framework into a new repository (DataFlow-Agent). Recent updates (Jan 2026) include feature enhancements like Image2PPT, API standardization, and dynamic model selection. Community engagement is facilitated via a WeChat group.

Licensing & Compatibility

The project is licensed under the Apache License 2.0. This permissive license generally allows for commercial use and integration into closed-source projects without significant restrictions.

Limitations & Caveats

Native Windows installation is less recommended than Linux or WSL, suggesting potential setup complexities or performance issues. The ongoing architectural split into two distinct repositories may impact the long-term development focus and integration of features within this specific repo. Setup requires a substantial number of external dependencies and configuration steps, including Supabase integration.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
25
Issues (30d)
5
Star History
647 stars in the last 30 days

Explore Similar Projects

Starred by Peter Norvig Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google).

NanoBananaEditor by markfulton

0.8%
601
Advanced AI image generation and editing platform
Created 5 months ago
Updated 5 months ago
Feedback? Help us improve.