Ali Soroush
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Evaluating large language models on medical evidence summarization
2023 · 349 Zit. · npj Digital Medicine
Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments
2023 · 316 Zit. · Scientific Reports
Large Language Models Are Poor Medical Coders — Benchmarking of Medical Code Querying
2024 · 116 Zit. · NEJM AI
Evaluating and addressing demographic disparities in medical large language models: a systematic review
2025 · 56 Zit. · International Journal for Equity in Health
Large language model uncertainty proxies: discrimination and calibration for medical diagnosis and treatment
2024 · 46 Zit. · Journal of the American Medical Informatics Association
Large language models: a primer and gastroenterology applications
2024 · 43 Zit. · Therapeutic Advances in Gastroenterology
Extracting International Classification of Diseases Codes from Clinical Documentation Using Large Language Models
2024 · 14 Zit. · Applied Clinical Informatics
Generative Artificial Intelligence in Clinical Medicine and Impact on Gastroenterology
2025 · 12 Zit. · Gastroenterology
Assessing GPT-3.5 and GPT-4 in Generating International Classification of Diseases Billing Codes
2023 · 11 Zit. · medRxiv
Large Language Model Uncertainty Measurement and Calibration for Medical Diagnosis and Treatment
2024 · 9 Zit. · medRxiv
Evaluating and Addressing Demographic Disparities in Medical Large Language Models: A Systematic Review
2024 · 8 Zit. · medRxiv
General purpose large language models match human performance on gastroenterology board exam self-assessments
2023 · 7 Zit. · medRxiv
Natural Language Programming in Medicine: Administering Evidence Based Clinical Workflows with Autonomous Agents Powered by Generative Large Language Models
2024 · 7 Zit. · arXiv (Cornell University)
Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models
2024 · 5 Zit. · arXiv (Cornell University)
Multimodal Large Language Model Passes Specialty Board Examination and Surpasses Human Test-Taker Scores: A Comparative Analysis Examining the Stepwise Impact of Model Prompting Strategies on Performance
2024 · 4 Zit. · medRxiv