Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Novel Approach for Classifying Diabetes’ Patients Based on Imputation and Machine Learning
23
Zitationen
4
Autoren
2020
Jahr
Abstract
Since the last decade, many research studies has been conducted on machine learning-based diabetes disease prediction using diagnostic measurement. However, the main challenge in machine learning-based diabetes disease prediction is the preprocessing of data, which contains, in most cases missing values and outliers. For data analytics and accurate prediction, data cleansing is highly desired and recommended. The goal of this study is to predict diabetic patients using realworld datasets. The proposed approach is based on three main steps: cleansing, modelling, and storytelling. In the first step, an imputation process is conducted to remove missing values. Then, k-nearest neighbor's algorithm is applied to classify patients. To evaluate the performance of the proposed approach, two criteria, namely the F1 score and the Receiver Operating Characteristic (ROC) has been used. F1 score and ROC curve show a clear distinction between diabetic and nondiabetic patients.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.450 Zit.
UCI Machine Learning Repository
2007 · 24.319 Zit.
An introduction to ROC analysis
2005 · 20.990 Zit.
Prediction of Coronary Heart Disease Using Risk Factor Categories
1998 · 9.605 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.190 Zit.