Paper2Any by OpenDCAI

AI transforms research papers into editable figures and slides

Created 4 months ago

1,529 stars

Top 27.0% on SourcePulse

Project Summary

Summary

Paper2Any addresses the challenge of transforming research papers, text, or topics into editable visual assets like figures, technical diagrams, and presentation slides. It targets researchers and technical professionals, offering a streamlined workflow to generate essential academic and presentation materials efficiently.

How It Works

This project employs AI-powered multimodal workflows to process inputs such as PDF documents, screenshots, or plain text. It leverages specialized agents and toolkits to generate editable outputs including scientific figures (model architectures, roadmaps, plots), presentation slides, and video scripts. The approach emphasizes one-click generation and editable outputs in formats like PPTX and SVG, aiming for speed and user-friendliness.

Quick Start & Requirements

Installation is recommended via Conda with Python 3.11 (Linux) or 3.12 (Windows). Key steps involve cloning the repository, installing base dependencies (requirements-base.txt), and then specific Paper2Any dependencies (requirements-paper.txt). Crucial external requirements include a LaTeX engine (tectonic via conda), system packages like inkscape, libreoffice, poppler-utils, and wkhtmltopdf (on Ubuntu). Environment variables for API keys and optional GPU configurations are necessary. Supabase credentials are also required for both frontend and backend services. Native Windows installation is noted as less recommended than Linux/WSL. An online demo is available at http://dcai-paper2any.nas.cpolar.cn/.

Highlighted Details

Paper2Figure: Generates editable scientific figures (model diagrams, technical roadmaps, experimental plots) from paper content.
Paper2PPT: Creates editable slide decks from papers or text, supporting long documents and intelligent content extraction.
PDF2PPT: Offers layout-preserving conversion of PDFs into editable PPTX files.
PPT Smart Beautification: Applies AI-based optimization and style transfer to presentation slides.

Maintenance & Community

The project is undergoing a significant architectural split, separating multimodal paper workflows into this repository (Paper2Any) and a general-purpose multi-agent dataflow framework into a new repository (DataFlow-Agent). Recent updates (Jan 2026) include feature enhancements like Image2PPT, API standardization, and dynamic model selection. Community engagement is facilitated via a WeChat group.

Licensing & Compatibility

The project is licensed under the Apache License 2.0. This permissive license generally allows for commercial use and integration into closed-source projects without significant restrictions.

Limitations & Caveats

Native Windows installation is less recommended than Linux or WSL, suggesting potential setup complexities or performance issues. The ongoing architectural split into two distinct repositories may impact the long-term development focus and integration of features within this specific repo. Setup requires a substantial number of external dependencies and configuration steps, including Supabase integration.

Paper2Any by OpenDCAI

Explore Similar Projects

AIA-Academic-Illustrator- by qwwzdyj

AutoFigure-Edit by ResearAI

peacasso by victordibia

Auto-Slides by Westlake-AGI-Lab

NextCreator by MoonWeSif

long_stable_diffusion by sharonzhou

Nano-PDF by gavrielc

TrainPPTAgent by johnson7788

DeepSeek-OCR-Web by fufankeji

NanoBananaEditor by markfulton

ppt-master by hugohe3

Paper2Slides by HKUDS