Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Towards Conversational Medical AI with Eyes, Ears and a Voice
0
Zitationen
53
Autoren
2026
Jahr
Abstract
The practice of medicine relies not only upon skillful dialogue but also on the nuanced exchange and interpretation of rich auditory and visual cues between doctors and patients. Building on the low-latency voice and video processing capabilities of Gemini, we introduce AI co-clinician, a first-of-its-kind conversational AI system utilizing continuous streams of audio-visual data from live patient conversations to inform real-time clinical decisions. Its dual-agent architecture balances deep clinical reasoning with the low latency required for natural dialogue. To assess this system, we implemented a video-based interface emulating telemedicine consultations. We crafted 20 standardized outpatient scenarios requiring proactive real-time auditory and visual reasoning and designed "TelePACES" evaluation criteria alongside case-specific rubrics. In a randomized, interface-blinded, crossover simulation study (n = 120 encounters) with 10 internal medicine residents as patient actors, we compared AI co-clinician with primary care physicians (PCPs), GPT-Realtime, and a baseline agent. AI co-clinician approached PCPs in key TelePACES dimensions, including management plans and differential diagnosis, while significantly outperforming GPT-Realtime across all general criteria. While our agent demonstrated parity with PCPs in case-specific triage measures, physicians maintained superior overall performance in case-specific assessments. Although AI co-clinician marks a significant advance in real-time telemedical AI, gaps remain in physical examination and disease-specific reasoning. Our work shows that text-only approaches fail to capture the true challenges of medical consultation and suggests that high-stakes real-time diagnostic AI is most safely advanced in collaborative, triadic models where AI can be a supportive co-clinician for doctors and patients.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.778 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.690 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.259 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.901 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
- Meet Shah
- Jason Gusdorf
- Anil Palepu
- Chunjong Park
- Jack W. O'Sullivan
- Vishnu Ravi
- Tim Strother
- Pavel Dubov
- Aliya Rysbek
- Toshiyuki Fukuzawa
- Yana Lunts
- Jan Freyberg
- Michael B. Chang
- Aniruddh Raghu
- David Stutz
- Devora Berlowitz
- Eliseo Papa
- Taylan Cemgil
- JD Velasquez
- Jack Chen
- Arthur Chen
- Doug Fritz
- Charlie Taylor
- Katya Tregubova
- Jing Rong Lim
- Richard Green
- Sara Mahdavi
- Mahvish Nagda
- Jihyeon Lee
- Craig Schiff
- Liviu Panait
- Sukhdeep Singh
- Valentin Liévin
- David G. T. Barrett
- Hannah Gladman
- Anna Cupani
- Francesca Pietra
- Uchechi Okereke
- Katherine Tong
- Clemens Meyer
- Erwan Rolland
- Mili Sanwalka
- Michael D. Howell
- Shixiang Gu
- Bibo Xu
- Euan A. Ashley
- S. M. Ali Eslami
- Gregory Wayne
- Pushmeet Kohli
- Vivek Natarajan
- Adam Rodman
- Alan Karthikesalingam
- Ryutaro Tanno