Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Leveraging LLMs for Efficient Data Structure Standardization in Chinese Medical Examination Reports
1
Zitationen
5
Autoren
2025
Jahr
Abstract
This study investigates the potential of large language models (LLMs) in standardizing Chinese medical examination reports, which are crucial for health assessment but often suffer from non-standardized terminologies across different hospitals. We utilized a dataset of over 900,000 Chinese reports and explored the effectiveness of prompt engineering and LoRA fine-tuning techniques on various LLMs. Our results demonstrate that fine-tuned LLMs, particularly Qwen2.5-14B, achieved high accuracy in predicting standardized department and item detail names, significantly outperforming traditional methods like BERT. Notably, the model exhibits strong generalization capabilities, achieving high accuracy on unseen hospital data with an average accuracy of 98.34% in cross-validation experiments. This research highlights the potential of LLMs in medical data standardization, laying the foundation for automated processing and analysis of large-scale medical data.
Ähnliche Arbeiten
"Why Should I Trust You?"
2016 · 14.588 Zit.
A Comprehensive Survey on Graph Neural Networks
2020 · 8.861 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.423 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.917 Zit.
Artificial intelligence in healthcare: past, present and future
2017 · 4.494 Zit.