Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Do segmentation metrics reflect clinical reality? A surgeon-centered evaluation in robot-assisted minimally invasive esophagectomy
1
Zitationen
9
Autoren
2025
Jahr
Abstract
Abstract Background Deep learning-based anatomy segmentation holds promise for improving real-time guidance in complex surgeries such as robot-assisted minimally invasive esophagectomy (RAMIE). However, the clinical relevance of commonly used metrics for evaluating segmentation quality remains unclear, as previous assessments have lacked direct input from surgeons. This study aims to assess how well quantitative segmentation metrics reflect surgeons’ assessments of anatomical overlay accuracy and clinical usefulness during RAMIE. Methods We conducted a survey involving 26 upper gastrointestinal surgeons, including both trainee and attending surgeons, who assessed video clips of RAMIE procedures featuring deep learning-generated anatomical overlays. We correlated the surgeons’ qualitative evaluations of annotation accuracy and clinical usefulness with a comprehensive set of quantitative metrics, including overlap, distance, temporal, and error-specific measures. The analysis encompassed over 8000 manually annotated frames from 12 video clips, with overlays generated by two state-of-the-art deep learning models. Results Overlap and temporal consistency metrics show the strongest correlation with surgeon assessments. Distance-based and error-specific metrics correlate moderately. Novices show weaker correlations and tend to rate overlays more leniently. Qualitative feedback reveals issues like hallucinations and instability, often missed by current metrics. Conclusion Standard quantitative metrics partially reflect surgeon perceptions but should be complemented by surgeon-informed evaluations and task-specific metrics to better capture clinically relevant errors. Aligning metric design with surgical expertise is essential for the safe and effective integration of AI-guided anatomical segmentation in the operating room. Graphical abstract
Ähnliche Arbeiten
Global cancer statistics, 2012
2015 · 27.340 Zit.
Human Papillomavirus and Survival of Patients with Oropharyngeal Cancer
2010 · 6.484 Zit.
Perioperative Chemotherapy versus Surgery Alone for Resectable Gastroesophageal Cancer
2006 · 6.339 Zit.
Preoperative Chemoradiotherapy for Esophageal or Junctional Cancer
2012 · 5.430 Zit.
A Predictive Model for Aggressive Non-Hodgkin's Lymphoma
1993 · 4.956 Zit.