LLM-scientific-feedback  by Weixin-Liang

Research paper assessing LLM feedback on scientific papers

Created 2 years ago
526 stars

Top 60.0% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the Python code for an empirical analysis investigating the utility of Large Language Models (LLMs) in providing feedback on research papers. It targets researchers seeking to understand LLM capabilities in scientific review, offering insights into how LLM feedback compares to human feedback and user perceptions.

How It Works

The project utilizes an automated pipeline powered by GPT-4 to generate comments on full research paper PDFs. It evaluates feedback quality through two large-scale studies: quantitative comparison with human peer reviewer feedback across 15 Nature family journals and the ICLR conference, and a prospective user study with 308 researchers. The approach aims to quantify the overlap between LLM and human feedback points and gauge user satisfaction.

Quick Start & Requirements

  • Install/Run: Requires setting up a PDF parsing server and an LLM feedback server.
    • PDF Parsing Server: python -m sciencebeam_parser.service.server --port=8080 (Requires conda activate ScienceBeam after conda env create -f conda_environment.yml).
    • LLM Feedback Server: conda create -n llm python=3.10, conda activate llm, pip install -r requirements.txt, echo "YOUR_OPENAI_API_KEY" > key.txt, then run python main.py or python main_from_text.py.
  • Prerequisites:
    • PDF Parsing Server: x86 Linux operating system.
    • LLM Feedback Server: Python 3.10, OpenAI API key.
  • Setup: The README indicates feedback generation takes approximately 120 seconds.
  • Links: PDF, Twitter, GPT Store

Highlighted Details

  • Overlap with human feedback: 30.85% (Nature journals), 39.23% (ICLR).
  • Higher overlap with weaker papers (43.80% for rejected ICLR papers).
  • User study: 57.4% found feedback helpful/very helpful; 82.4% found it more beneficial than some human reviewers.
  • LLM limitations identified: focus on specific aspects, struggles with in-depth method critique.

Maintenance & Community

The project is associated with authors from multiple institutions, including the University of Michigan. Further community or maintenance details are not explicitly provided in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the README. Code is provided for research purposes.

Limitations & Caveats

The ScienceBeam PDF parser is explicitly stated to support only x86 Linux operating systems. The LLM feedback generation may focus on certain aspects and struggle with in-depth methodological critique.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.