Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Peer Review of “The Performance of DeepSeek R1 and Gemini 3 in Complex Medical Scenarios: Comparative Study”
1
Zitationen
1
Autoren
2026
Jahr
Abstract
This paper [1] seeks to evaluate the accuracy of DeepSeek R1 in correctly identifying the primary medical diagnosis in the medical scenarios dataset portion of Massive Multitask Language Understanding Pro (MMLU-Pro) using an open-ended format.Some clarifications on the methods and results (especially around the roles of subject matter experts vs core team members in the publication), would be helpful in understanding how these results were derived.
Ähnliche Arbeiten
2022 · 19.590 Zit.
MIMIC-III, a freely accessible critical care database
2016 · 8.050 Zit.
Clarifying Confusion: The Confusion Assessment Method
1990 · 5.259 Zit.
The impact of the MIT-BIH Arrhythmia Database
2001 · 4.586 Zit.
A model for types and levels of human interaction with automation
2000 · 3.751 Zit.