OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 12.05.2026, 16:17

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Towards Automatic Generation of Shareable Synthetic Clinical Notes Using\n Neural Language Models

2019·0 Zitationen·arXiv (Cornell University)Open Access
Volltext beim Verlag öffnen

0

Zitationen

2

Autoren

2019

Jahr

Abstract

Large-scale clinical data is invaluable to driving many computational\nscientific advances today. However, understandable concerns regarding patient\nprivacy hinder the open dissemination of such data and give rise to suboptimal\nsiloed research. De-identification methods attempt to address these concerns\nbut were shown to be susceptible to adversarial attacks. In this work, we focus\non the vast amounts of unstructured natural language data stored in clinical\nnotes and propose to automatically generate synthetic clinical notes that are\nmore amenable to sharing using generative models trained on real de-identified\nrecords. To evaluate the merit of such notes, we measure both their privacy\npreservation properties as well as utility in training clinical NLP models.\nExperiments using neural language models yield notes whose utility is close to\nthat of the real ones in some clinical NLP tasks, yet leave ample room for\nfuture improvements.\n

Ähnliche Arbeiten

Autoren

Themen

Machine Learning in HealthcareTopic ModelingArtificial Intelligence in Healthcare and Education
Volltext beim Verlag öffnen