GPT as a Reviewer: Automatic Evaluation of Academic Papers

Berfin Tas, Meltem Aksoy*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

—This study investigates the potential of using GPT models, specifically GPT-3.5 and GPT-4 variants, as automated reviewers in academic peer review processes. Experiments were conducted using the ACL-2017 dataset, employing both zero-shot learning and in-context learning techniques across various settings, including baseline, importance assignment, and persona assignment, with different prompt designs. These settings tested the models’ effectiveness in scoring based on predefined evaluation criteria, both with and without scoring thresholds. The results highlight how various prompt strategies, settings, and threshold applications influenced model performance. Among the models, GPT-4o and GPT-4o-mini showed particularly promising results. While GPT models performed well in certain areas, they still have limitations in fully capturing the complexities of peer review. Nevertheless, the findings suggest that GPT models can serve as a helpful tool to support human reviewers in the peer review process.

Original languageEnglish
Title of host publicationProceedings - 2025 11th International Conference on Computing and Artificial Intelligence, ICCAI 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages487-496
Number of pages10
ISBN (Electronic)9798331524913
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event11th International Conference on Computing and Artificial Intelligence, ICCAI 2025 - Kyoto, Japan
Duration: 28 Mar 202531 Mar 2025

Publication series

NameProceedings - 2025 11th International Conference on Computing and Artificial Intelligence, ICCAI 2025

Conference

Conference11th International Conference on Computing and Artificial Intelligence, ICCAI 2025
Country/TerritoryJapan
CityKyoto
Period28/03/2531/03/25

Bibliographical note

Publisher Copyright:
©2025 IEEE.

Keywords

  • GPT
  • LLMs
  • Peer review
  • automatic scoring
  • prompt engineering

Fingerprint

Dive into the research topics of 'GPT as a Reviewer: Automatic Evaluation of Academic Papers'. Together they form a unique fingerprint.

Cite this