Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
P309 Reliability concerns: can AI interpret nuanced medical and ethical scenarios in the field of gastroenterology
0
Zitationen
12
Autoren
2024
Jahr
Abstract
<h3>Introduction</h3> AI tools like ChatGPT and Google Bard are gaining traction in healthcare, notably gastroenterology, offering benefits such as vast data knowledge, swift responses, and easy access. However, their reliability in medical and ethical decision-making is uncertain. They provide information effectively but cannot fully emulate the nuanced understanding and empathy of human medical professionals. Especially in ethical decisions, which demand comprehension of personal and contextual factors, these AI tools should serve as adjuncts, not replacements, to expert human judgment <h3>Methods</h3> The study evaluated the medical and ethical dependability of two widely-used chatbots, ChatGPT, and Google BARD, within the gastroenterology sphere. A questionnaire was administered to both bots, with their responses being rated using a 1–10 Likert scale where 1 indicated exceptional accuracy. To ensure unbiased evaluation, two independent assessors analyzed each bot’s answers. The goal was to systematically evaluate the chatbots’ competencies and trustworthiness using this performance review. The involvement of dual evaluators and the application of the Likert scale aimed to mitigate any potential bias, therefore strengthening the validity of the findings <h3>Results</h3> Our study compared the dependability of ChatGPT and Google BARD in medical management scenarios. ChatGPT scored 21% (p < 0.01), and Google BARD scored 19% (p=0.022) in terms of reliability when juxtaposed with standardized practices. Among the chatbots, ChatGPT had a higher score relative to Google BARD (67% vs. 41%, p=0.034). However, both chatbots’ reliability scores were inferior compared to standard practice. This underscores the importance of reliability in developing gastroenterology-focused chatbots and the need for ongoing research and improvements in this field <h3>Discussion</h3> Despite potential benefits, AI tools like ChatGPT and Google Bard currently fall short in assisting medical and ethical decisions in gastroenterology, as shown by lower reliability scores against standardized guidelines. Although ChatGPT marginally outperformed Google Bard, both fail to match the nuanced understanding and empathy of human healthcare professionals. This underlines the crucial need for AI dependability and the importance of ongoing research to enhance these technologies, ensuring they support, not supplant, human judgment in decision-making
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.418 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.288 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.726 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.516 Zit.