TY - JOUR
T1 - Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records
AU - Ding, Jun En
AU - Thao, Phan Nguyen Minh
AU - Peng, Wen Chih
AU - Wang, Jian Zhe
AU - Chug, Chun Cheng
AU - Hsieh, Min Chen
AU - Tseng, Yun Chien
AU - Chen, Ling
AU - Luo, Dongsheng
AU - Wu, Chenwei
AU - Wang, Chi Te
AU - Hsu, Chih Ho
AU - Chen, Yi Tui
AU - Chen, Pei Fu
AU - Liu, Feng
AU - Hung, Fang Ming
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/12
Y1 - 2024/12
N2 - Type 2 diabetes mellitus (T2DM) is a prevalent health challenge faced by countries worldwide. In this study, we propose a novel large language multimodal models (LLMMs) framework incorporating multimodal data from clinical notes and laboratory results for diabetes risk prediction. We collected five years of electronic health records (EHRs) dating from 2017 to 2021 from a Taiwan hospital database. This dataset included 1,420,596 clinical notes, 387,392 laboratory results, and more than 1505 laboratory test items. Our method combined a text embedding encoder and multi-head attention layer to learn laboratory values, and utilized a deep neural network (DNN) module to merge blood features with chronic disease semantics into a latent space. In our experiments, we observed that integrating clinical notes with predictions based on textual laboratory values significantly enhanced the predictive capability of the unimodal model in the early detection of T2DM. Moreover, we achieved an area greater than 0.70 under the receiver operating characteristic curve (AUC) for new-onset T2DM prediction, demonstrating the effectiveness of leveraging textual laboratory data for training and inference in LLMs and improving the accuracy of new-onset diabetes prediction.
AB - Type 2 diabetes mellitus (T2DM) is a prevalent health challenge faced by countries worldwide. In this study, we propose a novel large language multimodal models (LLMMs) framework incorporating multimodal data from clinical notes and laboratory results for diabetes risk prediction. We collected five years of electronic health records (EHRs) dating from 2017 to 2021 from a Taiwan hospital database. This dataset included 1,420,596 clinical notes, 387,392 laboratory results, and more than 1505 laboratory test items. Our method combined a text embedding encoder and multi-head attention layer to learn laboratory values, and utilized a deep neural network (DNN) module to merge blood features with chronic disease semantics into a latent space. In our experiments, we observed that integrating clinical notes with predictions based on textual laboratory values significantly enhanced the predictive capability of the unimodal model in the early detection of T2DM. Moreover, we achieved an area greater than 0.70 under the receiver operating characteristic curve (AUC) for new-onset T2DM prediction, demonstrating the effectiveness of leveraging textual laboratory data for training and inference in LLMs and improving the accuracy of new-onset diabetes prediction.
UR - http://www.scopus.com/inward/record.url?scp=85203289369&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203289369&partnerID=8YFLogxK
U2 - 10.1038/s41598-024-71020-2
DO - 10.1038/s41598-024-71020-2
M3 - Article
C2 - 39237580
AN - SCOPUS:85203289369
VL - 14
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 20774
ER -