Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Can large language models fully automate or partially assist paper selection in systematic reviews?
17
Zitationen
13
Autoren
2025
Jahr
Abstract
BACKGROUND/AIMS: Large language models (LLMs) have substantial potential to enhance the efficiency of academic research. The accuracy and performance of LLMs in a systematic review, a core part of evidence building, has yet to be studied in detail. METHODS: We introduced two LLM-based approaches of systematic review: an LLM-enabled fully automated approach (LLM-FA) utilising three different GPT-4 plugins (Consensus GPT, Scholar GPT and GPT web browsing modes) and an LLM-facilitated semi-automated approach (LLM-SA) using GPT4's Application Programming Interface (API). We benchmarked these approaches using three published systematic reviews that reported the prevalence of diabetic retinopathy across different populations (general population, pregnant women and children). RESULTS: The three published reviews consisted of 98 papers in total. Across these three reviews, in the LLM-FA approach, Consensus GPT correctly identified 32.7% (32 out of 98) of papers, while Scholar GPT and GPT4's web browsing modes only identified 19.4% (19 out of 98) and 6.1% (6 out of 98), respectively. On the other hand, the LLM-SA approach not only successfully included 82.7% (81 out of 98) of these papers but also correctly excluded 92.2% of 4497 irrelevant papers. CONCLUSIONS: Our findings suggest LLMs are not yet capable of autonomously identifying and selecting relevant papers in systematic reviews. However, they hold promise as an assistive tool to improve the efficiency of the paper selection process in systematic reviews.
Ähnliche Arbeiten
The PRISMA 2020 statement: an updated guideline for reporting systematic reviews
2021 · 90.426 Zit.
Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement
2009 · 83.073 Zit.
The Measurement of Observer Agreement for Categorical Data
1977 · 77.972 Zit.
Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement
2009 · 63.550 Zit.
Measuring inconsistency in meta-analyses
2003 · 62.181 Zit.
Autoren
Institutionen
- Tsinghua University(CN)
- Peking University(CN)
- Singapore National Eye Center(SG)
- Singapore Eye Research Institute(SG)
- National University of Singapore(SG)
- Shanghai Jiao Tong University(CN)
- Sun Yat-sen University(CN)
- Beihang University(CN)
- Johns Hopkins University(US)
- Moorfields Eye Hospital(GB)
- University College London(GB)
- Beijing Tsinghua Chang Gung Hospital(CN)
- McMaster University(CA)
- Duke-NUS Medical School(SG)