LLM-scientific-feedback by Weixin-Liang

Research paper assessing LLM feedback on scientific papers

Created 2 years ago

526 stars

Top 60.0% on SourcePulse

Project Summary

This repository provides the Python code for an empirical analysis investigating the utility of Large Language Models (LLMs) in providing feedback on research papers. It targets researchers seeking to understand LLM capabilities in scientific review, offering insights into how LLM feedback compares to human feedback and user perceptions.

How It Works

The project utilizes an automated pipeline powered by GPT-4 to generate comments on full research paper PDFs. It evaluates feedback quality through two large-scale studies: quantitative comparison with human peer reviewer feedback across 15 Nature family journals and the ICLR conference, and a prospective user study with 308 researchers. The approach aims to quantify the overlap between LLM and human feedback points and gauge user satisfaction.

Quick Start & Requirements

Install/Run: Requires setting up a PDF parsing server and an LLM feedback server.
- PDF Parsing Server: python -m sciencebeam_parser.service.server --port=8080 (Requires conda activate ScienceBeam after conda env create -f conda_environment.yml).
- LLM Feedback Server: conda create -n llm python=3.10, conda activate llm, pip install -r requirements.txt, echo "YOUR_OPENAI_API_KEY" > key.txt, then run python main.py or python main_from_text.py.
Prerequisites:
- PDF Parsing Server: x86 Linux operating system.
- LLM Feedback Server: Python 3.10, OpenAI API key.
Setup: The README indicates feedback generation takes approximately 120 seconds.
Links: PDF, Twitter, GPT Store

Highlighted Details

Overlap with human feedback: 30.85% (Nature journals), 39.23% (ICLR).
Higher overlap with weaker papers (43.80% for rejected ICLR papers).
User study: 57.4% found feedback helpful/very helpful; 82.4% found it more beneficial than some human reviewers.
LLM limitations identified: focus on specific aspects, struggles with in-depth method critique.

Maintenance & Community

The project is associated with authors from multiple institutions, including the University of Michigan. Further community or maintenance details are not explicitly provided in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the README. Code is provided for research purposes.

Limitations & Caveats

The ScienceBeam PDF parser is explicitly stated to support only x86 Linux operating systems. The LLM feedback generation may focus on certain aspects and struggle with in-depth methodological critique.

LLM-scientific-feedback by Weixin-Liang

Explore Similar Projects

gpt3-list by swyxio

Awesome-LLM-as-a-judge by llm-as-a-judge

GPT-Fathom by GPT-Fathom

LLMsStudy by XingYu-Zhong

CodeLLMPaper by PurCL

llm-paper-daily by xianshang33

PandaLM by WeOpenML

Awesome-LLM-for-RecSys by CHIANGEL

evaluation-guidebook by huggingface

ChatReviewer by nishiwen1214

cs-paper-checklist by yzhao062

ai-pr-reviewer by coderabbitai