TY - JOUR
T1 - Prediction for the Risk of Multiple Chronic Conditions among Working Population in the United States with Machine Learning Models
AU - Yang, Jingmei
AU - Ju, Xinglong
AU - Liu, Feng
AU - Asan, Onur
AU - Church, Timothy
AU - Smith, Jeff
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2021
Y1 - 2021
N2 - Objective: Chronic diseases have become the most prevalent and costly health conditions in the healthcare industry, deteriorating the quality of life, adversely affecting the work productivity, and costing astounding medical resources. However, few studies have been conducted on the predictive analysis of multiple chronic conditions (MCC) based on the working population. Results: Seven machine learning algorithms are used to support the decision making of healthcare practitioner on the risk of MCC. The models were developed and validated using checkup data from 451,425 working population collected by the healthcare providers. Our result shows that all proposed models achieved satisfactory performance, with the AUC values ranging from 0.826 to 0.850. Among the seven predictive models, the gradient boosting tree model outperformed other models, achieving an AUC of 0.850. Conclusions: Our risk prediction model shows great promise in automating real-time diagnosis, supporting healthcare practitioners to target high-risk individuals efficiently, and helping healthcare practitioners tailor proactive strategies to prevent the onset or delay the progression of the chronic diseases.
AB - Objective: Chronic diseases have become the most prevalent and costly health conditions in the healthcare industry, deteriorating the quality of life, adversely affecting the work productivity, and costing astounding medical resources. However, few studies have been conducted on the predictive analysis of multiple chronic conditions (MCC) based on the working population. Results: Seven machine learning algorithms are used to support the decision making of healthcare practitioner on the risk of MCC. The models were developed and validated using checkup data from 451,425 working population collected by the healthcare providers. Our result shows that all proposed models achieved satisfactory performance, with the AUC values ranging from 0.826 to 0.850. Among the seven predictive models, the gradient boosting tree model outperformed other models, achieving an AUC of 0.850. Conclusions: Our risk prediction model shows great promise in automating real-time diagnosis, supporting healthcare practitioners to target high-risk individuals efficiently, and helping healthcare practitioners tailor proactive strategies to prevent the onset or delay the progression of the chronic diseases.
KW - Multiple chronic conditions
KW - health informatics
KW - machine learning
KW - predictive analysis
UR - http://www.scopus.com/inward/record.url?scp=85121057968&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121057968&partnerID=8YFLogxK
U2 - 10.1109/OJEMB.2021.3117872
DO - 10.1109/OJEMB.2021.3117872
M3 - Article
AN - SCOPUS:85121057968
VL - 2
SP - 291
EP - 298
JO - IEEE Open Journal of Engineering in Medicine and Biology
JF - IEEE Open Journal of Engineering in Medicine and Biology
ER -