Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
From ChatGPT to DeepSeek: assessing the accuracy and readability of AI-generated erectile dysfunction responses
0
Zitationen
5
Autoren
2026
Jahr
Abstract
This study aims to compare responses generated by DeepSeek, a new large language model, and ChatGPT to erectile dysfunction (ED)–related questions. The study was conducted by posing online queries to both ChatGPT-4o (OpenAI, United States) and DeepSeek V3 (Hangzhou DeepSeek Artificial Intelligence, China). The most frequently asked questions about ED were identified using Google Trends. The responses from both artificial intelligence (AI) were evaluated by three board-certified urologists, who rated their accuracy on a scale of 1 to 4. Readability was assessed using the Flesch Reading Ease Score (FRES), Flesch-Kincaid Grade Level (FKGL), and Gunning Fog Score (GFS). DeepSeek V3 received a significantly higher total reviewer score compared to ChatGPT-4o (4 (IQR 1) vs. 3 (IQR 1); p = 0.016), and its responses contained more words on average (233 (IQR 113) vs. 139 (IQR 99), p = 0.004). While no significant difference was observed in FRES (-5.7 (IQR 17.5) vs. -10.1 (IQR 17.7), p = 0.140), both FKGL ( 16.5 (IQR 1.9) vs. 17.9 (IQR 2.7), p = 0.034) and GFS (18.8 (IQR 4.9) vs.20.6 (IQR 5.8), p = 0.016) scores were significantly lower for DeepSeek-V3, indicating superior readability. While both ChatGPT-4o and DeepSeek-V3 generated fluent and readable responses, DeepSeek-V3 consistently provided longer, more comprehensive, and highly readable answers,accompanied by higher expert-rated accuracy to ED-related questions. These findings highlight the potential of newer AI models—driven by rapid competitive advancements—to effectively address patient inquiries in sensitive medical domains like ED.
Ähnliche Arbeiten
The Female Sexual Function Index (FSFI): A Multidimensional Self-Report Instrument for the Assessment of Female Sexual Function
2000 · 6.428 Zit.
The international index of erectile function (IIEF): a multidimensional scale for assessment of erectile dysfunction
1997 · 5.801 Zit.
Sexual Dysfunction in the United States
1999 · 5.098 Zit.
Impotence and Its Medical and Psychosocial Correlates: Results of the Massachusetts Male Aging Study
1994 · 5.078 Zit.
Efficacy and Safety of Tadalafil Monotherapy for Lower Urinary Tract Symptoms Secondary to Benign Prostatic Hyperplasia: A Meta-Analysis
2013 · 4.542 Zit.