Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating the Performance of LLMs in ICF Classification: Insights from Medical and General Models <sup>*</sup>
0
Zitationen
4
Autoren
2025
Jahr
Abstract
In the medical field, text data comprising personal anecdotes and detailed patient insights are often underutilized due to their unstructured nature and variability among clinicians. However, recent advances in Large Language Models (LLMs) present an opportunity to harness this data effectively. This paper explores the use of the International Classification of Functioning, Disability, and Health (ICF) framework recommended by the World Health Organization (WHO), which offers a holistic approach considering personal and environmental factors along with impairments, to structure textual descriptions systematically. The study investigates the application of medically fine-tuned LLMs, such as MedAlpaca and Meditron, for automated ICF creation, comparing their efficiency in processing real medical cases from two distinct contexts: rehabilitation and intensive care units. Additionally, we benchmark medical LLMs against general-purpose LLMs, including ChatGPT and Claude, to assess whether specialized models truly offer an advantage in medical classification tasks. Preliminary findings indicate that while medical LLMs show potential for ICF classification tasks, they may not necessarily outperform general-purpose models, as the complexity of ICF requires a deeper level of contextual understanding.International classification of functioning, disability and health (ICF), Intensive care, large language models, AI, medical, LLM, Rehabilitation.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.402 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.270 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.702 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.507 Zit.