Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Performances of five large language models in clinical decision-making for internal medicine: A comparative study
0
Zitationen
3
Autoren
2026
Jahr
Abstract
GPT, O1, and Gemini demonstrated superior performance in clinical decision-making for internal medicine among all LLMs, whereas Claude showed the poorest performance. All LLMs demonstrated deficiencies in differential diagnosis and poor management for respiratory diseases. The complexity of subspecialty might be a performance differentiator for LLMs and O1 might have potential suitability for complex subspecialties like cardiology.
Ähnliche Arbeiten
The Strengths and Difficulties Questionnaire: A Research Note
1997 · 14.611 Zit.
Making sense of Cronbach's alpha
2011 · 13.863 Zit.
QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies
2011 · 13.657 Zit.
A method for estimating the probability of adverse drug reactions
1981 · 11.485 Zit.
Evidence-Based Medicine
1992 · 4.153 Zit.