OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 10.04.2026, 02:08

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

P309 Reliability concerns: can AI interpret nuanced medical and ethical scenarios in the field of gastroenterology

2024·0 Zitationen·Poster presentations
Volltext beim Verlag öffnen

0

Zitationen

12

Autoren

2024

Jahr

Abstract

<h3>Introduction</h3> AI tools like ChatGPT and Google Bard are gaining traction in healthcare, notably gastroenterology, offering benefits such as vast data knowledge, swift responses, and easy access. However, their reliability in medical and ethical decision-making is uncertain. They provide information effectively but cannot fully emulate the nuanced understanding and empathy of human medical professionals. Especially in ethical decisions, which demand comprehension of personal and contextual factors, these AI tools should serve as adjuncts, not replacements, to expert human judgment <h3>Methods</h3> The study evaluated the medical and ethical dependability of two widely-used chatbots, ChatGPT, and Google BARD, within the gastroenterology sphere. A questionnaire was administered to both bots, with their responses being rated using a 1–10 Likert scale where 1 indicated exceptional accuracy. To ensure unbiased evaluation, two independent assessors analyzed each bot’s answers. The goal was to systematically evaluate the chatbots’ competencies and trustworthiness using this performance review. The involvement of dual evaluators and the application of the Likert scale aimed to mitigate any potential bias, therefore strengthening the validity of the findings <h3>Results</h3> Our study compared the dependability of ChatGPT and Google BARD in medical management scenarios. ChatGPT scored 21% (p &lt; 0.01), and Google BARD scored 19% (p=0.022) in terms of reliability when juxtaposed with standardized practices. Among the chatbots, ChatGPT had a higher score relative to Google BARD (67% vs. 41%, p=0.034). However, both chatbots’ reliability scores were inferior compared to standard practice. This underscores the importance of reliability in developing gastroenterology-focused chatbots and the need for ongoing research and improvements in this field <h3>Discussion</h3> Despite potential benefits, AI tools like ChatGPT and Google Bard currently fall short in assisting medical and ethical decisions in gastroenterology, as shown by lower reliability scores against standardized guidelines. Although ChatGPT marginally outperformed Google Bard, both fail to match the nuanced understanding and empathy of human healthcare professionals. This underlines the crucial need for AI dependability and the importance of ongoing research to enhance these technologies, ensuring they support, not supplant, human judgment in decision-making

Ähnliche Arbeiten