Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
GastroGPT: Development and controlled testing of a proof-of-concept customized clinical language model
3
Zitationen
14
Autoren
2025
Jahr
Abstract
Background and study aims: Current general-purpose artificial intelligence (AI) large language models (LLMs) demonstrate limited efficacy in clinical medicine, often constrained to question-answering, documentation, and literature summarization roles. We developed GastroGPT, a proof-of-concept specialty-specific, multi-task, clinical LLM, and evaluated its performance against leading general-purpose LLMs across key gastroenterology tasks and diverse case scenarios. Methods: In this structured analysis, GastroGPT was compared with three state-of-the-art general-purpose LLMs (LLM-A: GPT-4, LLM-B: Bard, LLM-C: Claude). Models were assessed on seven clinical tasks and overall performance across 10 simulated gastroenterology cases varying in complexity, frequency, and patient demographics. Standardized prompts facilitated structured comparisons. A blinded expert panel rated model outputs per task on a 10-point Likert scale, judging clinical utility. Comprehensive statistical analyses were conducted. Results: < 0.001). Conclusions: This study pioneered development and comparison of a specialty-specific, clinically-oriented AI model to general-purpose LLMs. GastroGPT demonstrated superior utility overall and on key gastroenterology tasks, highlighting the potential for tailored, task-focused AI models in medicine.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.644 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.550 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.061 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.850 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- Johns Hopkins University(US)
- Hacettepe University(TR)
- Hospital General Universitario de Alicante Doctor Balmis(ES)
- University Hospital Augsburg(DE)
- Palacký University Olomouc(CZ)
- University of Liverpool(GB)
- Aintree University Hospitals NHS Foundation Trust(GB)
- Spitalul Clinic Colentina(RO)
- Azienda Ospedaliera Sant'Andrea(IT)
- Sapienza University of Rome(IT)
- Hospital Quirón Teknon(ES)
- Hospital Universitario de La Princesa(ES)
- IRCCS Humanitas Research Hospital(IT)
- University College Hospital(GB)
- University College London(GB)