Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
ConTEXTual Net 3D: Vision-Language Modeling in PET/CT for Visual Grounding of Positive Findings
0
Zitationen
10
Autoren
2026
Jahr
Abstract
F-fluciclovine (F1 = 0.66) exams. In conclusion, our novel weak labeling pipeline accurately produced an annotated dataset of PET/CT image-text pairs. ConTEXTual Net 3D significantly outperformed other models but fell short of the performance of nuclear medicine physicians. Our study suggests that even larger datasets may be needed to close this performance gap.
Ähnliche Arbeiten
MizAR 60 for Mizar 50
2023 · 75.280 Zit.
ImageNet: A large-scale hierarchical image database
2009 · 61.042 Zit.
Microsoft COCO: Common Objects in Context
2014 · 41.569 Zit.
Fully convolutional networks for semantic segmentation
2015 · 36.589 Zit.
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
2017 · 20.811 Zit.