Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluation of large language models in clinical neuroanatomy: a comparative scoring analysis based on accuracy, concordance, insight, and anatomical terminology accuracy
0
Zitationen
1
Autoren
2026
Jahr
Abstract
ChatGPT-4 demonstrates strong and stable diagnostic performance in neuroanatomical cases, with high accuracy and precise anatomical language. Gemini 2.5 shows potential, but is more sensitive to prompt variations and performs inconsistently in complex scenarios. Structured scoring frameworks like ACI and ATA offer valuable tools for evaluating LLMs in both clinical and educational settings.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.521 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.412 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.891 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.575 Zit.