Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating GPT-4’s role in critical patient management in emergency departments
0
Zitationen
4
Autoren
2025
Jahr
Abstract
INTRODUCTION: Recent advancements in artificial intelligence (AI) have introduced tools like ChatGPT-4, capable of interpreting visual data, including ECGs. In our study,we aimed to investigate the effectiveness of GPT-4 in interpreting ECGs and managing patient care in emergency settings. METHODS: Conducted from April to May 2024, this study evaluated GPT-4 using twenty case scenarios sourced from PubMed Central and the OSCE sample question book. These cases, categorized into common and rare scenarios, were analyzed by GPT-4, and its interpretations were reviewed by five experienced emergency medicine specialists. The accuracy of ECG interpretations and subsequent patient management plans were assessed using a structured evaluation framework and critical error identification. RESULTS: GPT-4 made critical errors in 46% of ECG interpretations in the OSCE group and 50% in the PubMed group. For patient management, critical errors were found in 32% of the OSCE group and 14% of the PubMed group. When ECG evaluations were included in patient management, error rates approached 50%. The inter-rater reliability among evaluators indicated good agreement (ICC = 0.725, F = 3.72, p < 0.001). CONCLUSION: While GPT-4 shows promise in specific applications, its current limitations in accurately interpreting ECGs and managing critical patient scenarios render it inappropriate for emergency department use. Future improvements and extensive validations are essential before such AI tools can be reliably deployed in critical healthcare settings.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.652 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.567 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.083 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.856 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.