Discover and explore top open-source AI tools and projects—updated daily.
Paper reviewer for auto-generating blog posts
Top 44.5% on SourcePulse
This project provides tools to automatically generate comprehensive reviews of arXiv and OpenReview papers and convert them into blog posts. It is designed for researchers, academics, and content creators looking to streamline the process of summarizing and disseminating scientific literature, powering Hugging Face's Daily Papers and NeurIPS 2024 web pages.
How It Works
The system utilizes two primary Python scripts: collect.py
for gathering paper data and generating reviews, and convert.py
for transforming these reviews into blog post formats. collect.py
can leverage different backends for visual information extraction, including Upstage (paid) or Gemini (best-effort), and supports GPU acceleration via MinerU with specific configuration. convert.py
then applies a fixed template to structure the review into a blog post, with options for image uploading to Cloudflare R2.
Quick Start & Requirements
pip install -r requirements.txt
GEMINI_API_KEY
environment variable (mandatory for Gemini).UPSTAGE_API_KEY
(optional, for Upstage document parsing).R2_ACCESS_KEY_ID
, R2_SECRET_ACCESS_KEY
, R2_S3_ENDPOINT_URL
, R2_DOMAIN_NAME
) for image uploads.poppler-utils
(Ubuntu: apt install poppler-utils
, macOS: brew install poppler
).~/magic-pdf.json
to set "device-mode": "cuda"
.python collect.py --arxiv-id "..." [--stop-at-no-html] [--use-upstage]
python convert.py --arxiv-id "..." [--upload-images-r2]
Highlighted Details
Maintenance & Community
The project is actively maintained and powers significant Hugging Face initiatives. Further community engagement details are not explicitly provided in the README.
Licensing & Compatibility
The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.
Limitations & Caveats
The accuracy of visual information extraction without Upstage is noted as best-effort. Customizing the blog post design requires manual modification of the template files. MinerU usage requires specific Python versions and configuration adjustments for GPU support.
7 months ago
Inactive