Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Dual perspectives on large language models in rheumatology: physician-rated quality and patient-centered usability of GPT-4o versus DeepSeek-V3
0
Zitationen
4
Autoren
2026
Jahr
Abstract
OBJECTIVES: This study conducted an informatics system evaluation of two LLMs (GPT-4o and DeepSeek-V3) for patient education, combining clinician-rated quality with patient-perceived usability across thematically stratified queries. MATERIALS AND METHODS: In a blinded, within-subject design, 16 frequently asked questions about biologic therapies were categorized into three domains: treatment/drug selection, safety/adverse effects, and special conditions/daily life. Responses were standardized, generated without external retrieval, anonymized as A/B pairs. Thirty physicians assessed clinical appropriateness, scientific accuracy, comprehensiveness, while 60 patients rated readability, understandability, actionability, perceived adequacy, decision support, and trust on 5-point Likert scales. Analyses included paired t-tests, Holm/FDR corrections and two one-sided tests (TOST) to distinguish statistical non-difference from practical equivalence. RESULTS: < .001), while readability, adequacy, trust, and reading time were statistically and clinically equivalent. CONCLUSION: Findings highlight the need for topic-aware governance: guideline-dense queries suited to retrieval-augmented generation and checklist compliance, and context-sensitive queries requiring uncertainty signaling and human oversight. This layered approach advances health informatics by defining where LLMs may substitute versus where they require verification, supporting safe and auditable integration into patient education.
Ähnliche Arbeiten
The american rheumatism association 1987 revised criteria for the classification of rheumatoid arthritis
1988 · 19.887 Zit.
2010 Rheumatoid arthritis classification criteria: An American College of Rheumatology/European League Against Rheumatism collaborative initiative
2010 · 9.482 Zit.
Validation study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee.
1988 · 7.827 Zit.
Revised Criteria for the Classification of Rheumatoid Arthritis
1990 · 7.735 Zit.
Development of criteria for the classification and reporting of osteoarthritis: Classification of osteoarthritis of the knee
1986 · 6.750 Zit.