Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Deep learning prediction of error and skill in robotic prostatectomy suturing
8
Zitationen
10
Autoren
2024
Jahr
Abstract
BACKGROUND: Manual objective assessment of skill and errors in minimally invasive surgery have been validated with correlation to surgical expertise and patient outcomes. However, assessment and error annotation can be subjective and are time-consuming processes, often precluding their use. Recent years have seen the development of artificial intelligence models to work towards automating the process to allow reduction of errors and truly objective assessment. This study aimed to validate surgical skill rating and error annotations in suturing gestures to inform the development and evaluation of AI models. METHODS: SAR-RARP50 open data set was blindly, independently annotated at the gesture level in Robotic-Assisted Radical Prostatectomy (RARP) suturing. Manual objective assessment tools and error annotation methodology, Objective Clinical Human Reliability Analysis (OCHRA), were used as ground truth to train and test vision-based deep learning methods to estimate skill and errors. Analysis included descriptive statistics plus tool validity and reliability. RESULTS: Fifty-four RARP videos (266 min) were analysed. Strong/excellent inter-rater reliability (range r = 0.70-0.89, p < 0.001) and very strong correlation (r = 0.92, p < 0.001) between objective assessment tools was demonstrated. Skill estimation of OSATS and M-GEARS had a Spearman's Correlation Coefficient 0.37 and 0.36, respectively, with normalised mean absolute error representing a prediction error of 17.92% (inverted "accuracy" 82.08%) and 20.6% (inverted "accuracy" 79.4%) respectively. The best performing models in error prediction achieved mean absolute precision of 37.14%, area under the curve 65.10% and Macro-F1 58.97%. CONCLUSIONS: This is the first study to employ detailed error detection methodology and deep learning models within real robotic surgical video. This benchmark evaluation of AI models sets a foundation and promising approach for future advancements in automated technical skill assessment.
Ähnliche Arbeiten
The SCARE 2020 Guideline: Updating Consensus Surgical CAse REport (SCARE) Guidelines
2020 · 5.581 Zit.
The SCARE 2023 guideline: updating consensus Surgical CAse REport (SCARE) guidelines
2023 · 3.007 Zit.
Virtual Reality Training Improves Operating Room Performance
2002 · 2.810 Zit.
Objective structured assessment of technical skill (OSATS) for surgical residents
1997 · 2.263 Zit.
Does Simulation-Based Medical Education With Deliberate Practice Yield Better Results Than Traditional Clinical Education? A Meta-Analytic Comparative Review of the Evidence
2011 · 1.754 Zit.
Autoren
Institutionen
- Wellcome / EPSRC Centre for Interventional and Surgical Sciences(GB)
- University College London(GB)
- St Mark's Hospital(GB)
- London Vision Clinic(GB)
- University College London Hospitals NHS Foundation Trust(GB)
- Yeovil District Hospital NHS Foundation Trust(GB)
- The Francis Crick Institute(GB)
- Griffin Hospital(US)