Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Atlas-Assisted Bone Age Estimation from Hand–Wrist Radiographs Using Multimodal Large Language Models: A Comparative Study
1
Zitationen
2
Autoren
2026
Jahr
Abstract
<b>Background/Objectives:</b> Bone age assessment is critical in pediatric endocrinology and forensic medicine. Although recently developed multimodal large language models (LLMs) show potential in medical imaging, their diagnostic performance in bone age determination has not been sufficiently evaluated. This study evaluates the performance of four multimodal LLMs (ChatGPT-5, Gemini 2.5 Pro, Grok-3, and Claude 4 Sonnet) in bone age determination using the Gilsanz-Ratib (GR) atlas. <b>Methods:</b> This retrospective study included 245 pediatric patients (109 male, 136 female) under the age of 18 who underwent left wrist radiography. Each model estimated bone age using the patient's radiograph and GR atlas as reference (atlas-assisted prompting). Bone age assessments made by an experienced radiologist using the GR atlas were evaluated as the reference standard. Performance was assessed using mean absolute error (MAE), intraclass correlation coefficient (ICC), and Bland-Altman analysis. <b>Results:</b> ChatGPT-5 demonstrated statistically superior performance, with an MAE of 1.46 years and ICC of 0.849, showing the highest alignment with the reference standard. Gemini 2.5 Pro showed moderate performance, with an MAE of 2.24 years; Grok-3 (MAE: 3.14 years) and Claude 4 Sonnet (MAE: 4.29 years) had error rates that were too high for clinical use. <b>Conclusions:</b> Significant performance differences exist among multimodal LLMs, despite atlas-supported prompting. Only ChatGPT-5 qualified as "clinically useful," demonstrating potential as an auxiliary tool or educational support under expert supervision. Other models' reliability remains insufficient.
Ähnliche Arbeiten
A Draft Sequence of the Neandertal Genome
2010 · 4.495 Zit.
Standards for Data Collection from Human Skeletal Remains
1994 · 3.909 Zit.
Mitochondrial DNA and human evolution
1987 · 3.053 Zit.
Identification of Pathological Conditions in Human Skeletal Remains
2003 · 2.525 Zit.
The complete genome sequence of a Neanderthal from the Altai Mountains
2013 · 2.373 Zit.