AcademicAgent by Jennyee1

Multimodal academic agent for scholarly research and knowledge synthesis

Created 2 months ago

341 stars

Top 80.7% on SourcePulse

Project Summary

ScholarMind is a multimodal academic research agent designed for the LLM Agent domain. It assists users with tasks such as paper retrieval, PDF and chart comprehension, knowledge graph construction, learning path planning, and code reproduction. The agent integrates with MCP hosts, offering a streamlined workflow for researchers and developers seeking to manage and leverage academic knowledge efficiently.

How It Works

ScholarMind operates as a plugin within MCP (Model Context Protocol) hosts, leveraging a modular architecture. It employs dual-source search (Semantic Scholar, arXiv) and a PDF parser using PyMuPDF in generator mode to prevent Out-of-Memory errors. Core functionalities include multimodal understanding of figures and formulas, automatic knowledge graph construction constrained by Pydantic schemas and stored in NetworkX with a time dimension, and learning path planning informed by PageRank-based knowledge gap analysis. Code reproduction is handled via a secure subprocess sandbox.

Quick Start & Requirements

Installation involves cloning the repository, installing dependencies via pip install -r requirements.txt, and running the python install.py script for automated setup and configuration generation. Users must configure API keys (e.g., MINIMAX_API_KEY) in a .env file and merge the generated mcp_config.json into their MCP host's configuration (e.g., ~/.gemini/antigravity/mcp_config.json or ~/.claude/mcp_config.json).

Highlighted Details

Dual-source academic paper search (Semantic Scholar + arXiv) with automatic rate-limiting fallback.
Multimodal comprehension of PDF content, including charts, formulas, and system diagrams, with token usage control.
Automated personal academic knowledge graph construction using Pydantic schemas, NetworkX, and a time dimension (Zep).
Intelligent learning path recommendation driven by PageRank-based knowledge gap detection.
Secure code reproduction experiments executed within a subprocess sandbox.
Lightweight, file-system-based memory system inspired by Hermes, ReMe, MemU, and Zep.

Maintenance & Community

The provided README does not contain specific details regarding notable contributors, sponsorships, partnerships, or community channels (e.g., Discord, Slack).

Licensing & Compatibility

The project is released under the MIT license, which permits broad usage, including commercial applications and integration with closed-source projects.

Limitations & Caveats

The README does not explicitly detail known limitations, alpha status, or specific performance benchmarks. While multimodal understanding includes token control, specific performance trade-offs or unsupported modalities are not elaborated upon.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

4 stars in the last 30 days