Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Diagnostic Performance of ChatGPT-4o in Analyzing Oral Mucosal Lesions: A Comparative Study with Experts
6
Zitationen
20
Autoren
2025
Jahr
Abstract
<i>Background and Objectives:</i> this pilot study aimed to evaluate the diagnostic accuracy of ChatGPT-4o in analyzing oral mucosal lesions from clinical images. <i>Materials and Methods</i>: a total of 110 clinical images, including 100 pathological lesions and 10 healthy mucosal images, were retrieved from Google Images and analyzed by ChatGPT-4o using a standardized prompt. An expert panel of five clinicians established a reference diagnosis, categorizing lesions as benign or malignant. The AI-generated diagnoses were classified as correct or incorrect and further categorized as plausible or not plausible. The accuracy, sensitivity, specificity, and agreement with the expert panel were analyzed. The Artificial Intelligence Performance Instrument (AIPI) was used to assess the quality of AI-generated recommendations. <i>Results:</i> ChatGPT-4o correctly diagnosed 85% of cases. Among the 15 incorrect diagnoses, 10 were deemed plausible by the expert panel. The AI misclassified three malignant lesions as benign but did not categorize any benign lesions as malignant. Sensitivity and specificity were 91.7% and 100%, respectively. The AIPI score averaged 17.6 ± 1.73, indicating strong diagnostic reasoning. The McNemar test showed no significant differences between AI and expert diagnoses (<i>p</i> = 0.084). <i>Conclusions:</i> In this proof-of-concept pilot study, ChatGPT-4o demonstrated high diagnostic accuracy and strong descriptive capabilities in oral mucosal lesion analysis. A residual 8.3% false-negative rate for malignant lesions underscores the need for specialist oversight; however, the model shows promise as an AI-powered triage aid in settings with limited access to specialized care.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.400 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.261 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.695 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.506 Zit.
Autoren
- Luigi Angelo Vaira
- Jérôme R. Lechien
- Antonino Maniaci
- Andrea De Vito
- Miguel Mayo‐Yáñez
- Stefania Troise
- Giuseppe Consorti
- Carlos M. Chiesa‐Estomba
- Giovanni Cammaroto
- Thomas Radulesco
- Arianna Di Stadio
- Alessandro Tel
- Andrea Frosolini
- Guido Gabriele
- Giannicola Iannella
- Alberto Maria Saibene
- Paolo Boscolo‐Rizzo
- Giovanna Soro
- Giovanni Salzano
- Giacomo De Riu
Institutionen
- University of Sassari(IT)
- University of Mons(BE)
- Université de Poitiers(FR)
- Università degli Studi di Enna Kore(IT)
- Complexo Hospitalario Universitario A Coruña(ES)
- University of Naples Federico II(IT)
- Marche Polytechnic University(IT)
- Ospedali Riuniti Umberto I(IT)
- Biogipuzkoa Health Research Institute(ES)
- Ospedale G.B. Morgagni - L.Pierantoni(IT)
- Centre National de la Recherche Scientifique(FR)
- Aix-Marseille Université(FR)
- Hôpital de la Conception(FR)
- University of Catania(IT)
- University of Udine(IT)
- University of Siena(IT)
- Sapienza University of Rome(IT)
- AOL (United States)(US)
- Ospedale San Paolo(IT)
- University of Trieste(IT)