Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Responses of ChatGPT-4 on intraocular lenses: an evolving artificial intelligence assessment
1
Zitationen
2
Autoren
2025
Jahr
Abstract
CLINICAL RELEVANCE: Recently, studies have been carried out on the accuracy of large language models, which are increasingly used, but the reproducibility of the answers given by these language models is as important as their accuracy. BACKGROUND: To evaluate the accuracy and reproducibility of the responses of ChatGPT-4.0 to queries about intraocular lenses (IOLs) over a six-month interval. METHODS: This observational, cross-sectional study was conducted using 45 open-ended questions related to IOLs, categorised into three difficulty levels: easy, moderate, and difficult. Two evaluators independently posed these questions to ChatGPT-4.0 at two separate time points (April and October 2024) using the 'new conversation' feature. Responses were analysed for word count, character count, and plagiarism rates, and evaluated on a 5-point Likert scale for accuracy, relevance, and consistency. RESULTS: = 0.046). ICC analysis showed moderate consistency (0.639) overall, with excellent reproducibility (1.00) for easy questions, acceptable (0.788) for moderate, and moderate (0.550) for difficult questions. CONCLUSIONS: ChatGPT-4.0 demonstrated significant improvements in accuracy and response length over time. While the model exhibited high reproducibility for easy questions, its performance declined with increased question difficulty. Despite its limitations, ChatGPT-4.0 shows promise for use in medical information dissemination, highlighting the need for further studies to enhance its reliability and accuracy.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.774 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.685 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.244 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.898 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.