Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Large Language Model Agent for Managing Patients With Suspected Hypertension
7
Zitationen
11
Autoren
2025
Jahr
Abstract
BACKGROUND: The effectiveness of Large Language Model agent frameworks for hypertension screening and personalized health management has not been fully studied. This study aimed to develop and evaluate a Large Language Model–based Agent, called the Cascade Framework, and assess its effectiveness in hypertension education and clinical decision support. METHODS: The Cascade Framework was developed utilizing the Dify platform, and its performance was tested via a robust 2-phase evaluation protocol from August 2024 to June 2025. The first phase involved systematic performance benchmarking of 6 configurations: 3 foundational Large Language Models (Chat Generative Pretrained Transformer [ChatGPT]-4o, ChatGPT-4oMini, and DeepSeek-V3) and their respective Cascade-enhanced versions. The second phase included an external validation in a cohort of patients with suspected hypertension. RESULTS: Cascade integration yielded significant performance improvements across all models. For ChatGPT-4o, educational outcomes improved (Accuracy: 3.87→4.10, P =0.02; Comprehensiveness: 4.07→4.32, P =0.16; Credibility: 3.79→4.03, P <0.001; Understandability: 3.90→3.96, P =0.005; Emotional Support: 3.87→4.01, P <0.001). Blood pressure classification accuracy rose from 62.5% to 87.0% ( P <0.001) and risk factor stratification from 60.4% to 98.6% ( P <0.001). Clinical decision-making improved, with accuracy of 72.0% to 92.5%. A similar trend of performance improvement was observed in the external validation cohort, where the 4o-Cascade model achieved increases in blood pressure classification accuracy (58.9%→95.3%), risk stratification accuracy (71.0%→90.7%), and clinical decision appropriateness (66.4%→92.5%), all with P <0.001 and surpassing the performance of the 3 physicians. CONCLUSIONS: Cascade Framework can improve the management of hypertension. Its extensible architecture allows integration with existing clinical workflows while providing transparent reasoning pathways.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.774 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.685 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.244 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.898 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.
Autoren
Institutionen
- Sichuan University(CN)
- West China Hospital of Sichuan University(CN)
- First Affiliated Hospital of Bengbu Medical College(CN)
- Sun Yat-sen University(CN)
- Sun Yat-sen Memorial Hospital(CN)
- Wuhan University(CN)
- Renmin Hospital of Wuhan University(CN)
- Wannan Medical College(CN)
- Beijing Xuanwu Traditional Chinese Medicine Hospital(CN)
- Hubei University of Arts and Science(CN)
- Xiangyang Central Hospital(CN)
- Hubei University of Medicine(CN)
- Bengbu Medical College(CN)