Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
An Empirical Study Evaluating <scp>ChatGPT</scp>'s Performance in Generating Search Strategies for Systematic Reviews
4
Zitationen
3
Autoren
2024
Jahr
Abstract
ABSTRACT This study evaluated the performance of ChatGPT‐3.5 and ChatGPT‐4 in developing search strategies for systematic reviews. Using the Peer Review of Electronic Search Strategies (PRESS) framework, we employed a two‐round testing format for each version. In the first round, both versions displayed comparable competencies when assessed quantitatively by the PRESS measures. However, qualitative feedback from two professional health sciences librarians indicated that ChatGPT‐4 outperformed ChatGPT‐3.5, particularly in suggesting MeSH term inclusion and refining search strategy formulations. In the second round, prompts were refined based on the feedback from the previous round of testing. Both qualitative and quantitative evaluation results confirmed ChatGPT‐4's superiority. This study provides empirical evidence of advancements in language model capabilities, highlighting ChatGPT‐4's enhanced efficiency and accuracy in developing search strategies for systematic reviews.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.402 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.270 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.702 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.507 Zit.