Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning

Machine learning exhibits excellent performance in terms of predictive power. We aimed to construct an interpretable machine learning model utilizing National Health and Nutrition Examination Survey data to investigate the relationship between heavy metal exposure and cardiovascular disease (CVD). A...

Full description

Saved in:
Bibliographic Details
Main Authors: Meiyue Shen, Yine Zhang, Runqing Zhan, Tingwei Du, Peixuan Shen, Xiaochuan Lu, Shengnan Liu, Rongrong Guo, Xiaoli Shen
Format: Article
Language:English
Published: Elsevier 2025-01-01
Series:Ecotoxicology and Environmental Safety
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S0147651324016464
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823856906428481536
author Meiyue Shen
Yine Zhang
Runqing Zhan
Tingwei Du
Peixuan Shen
Xiaochuan Lu
Shengnan Liu
Rongrong Guo
Xiaoli Shen
author_facet Meiyue Shen
Yine Zhang
Runqing Zhan
Tingwei Du
Peixuan Shen
Xiaochuan Lu
Shengnan Liu
Rongrong Guo
Xiaoli Shen
author_sort Meiyue Shen
collection DOAJ
description Machine learning exhibits excellent performance in terms of predictive power. We aimed to construct an interpretable machine learning model utilizing National Health and Nutrition Examination Survey data to investigate the relationship between heavy metal exposure and cardiovascular disease (CVD). A total of 4600 adults were included in the analysis. The Least Absolute Shrinkage and Selection Operator regression method was employed to select relevant feature variables. Subsequently, six machine learning models were constructed, including random forest, decision tree, gradient boosting decision tree, k-nearest neighbor, support vector machine, and AdaBoost algorithms. Feature importance analysis, partial dependence plot, and shapley additive explanations were integrated to enhance the interpretability of the CVD prediction model. Among all models, the random forest exhibited the best performance, with an accuracy of 90 %, an area under the curve of 0.85, and an F1 score of 0.86. Urine cadmium (Cd), blood lead (Pb), urine thallium (Tl), and urine tungsten (W) were identified as the most significant predictors of CVD, with importance scores of 0.062, 0.057, 0.051, and 0.050, respectively. At the overall level, higher levels of urine Cd, blood Pb, and urine W were associated with an increased risk of CVD, whereas a lower level of urine Tl was linked to a reduced CVD risk. Additionally, the analysis of synergistic effects revealed that Cd was the predominant determinant of CVD risk. The random forest-based CVD prediction model demonstrated excellent predictive power and provided valuable insights for personalized patient care and optimal resource allocation in populations exposed to heavy metals.
format Article
id doaj-art-6a6db19fb882458584ba878354fe2294
institution Kabale University
issn 0147-6513
language English
publishDate 2025-01-01
publisher Elsevier
record_format Article
series Ecotoxicology and Environmental Safety
spelling doaj-art-6a6db19fb882458584ba878354fe22942025-02-12T05:29:50ZengElsevierEcotoxicology and Environmental Safety0147-65132025-01-01290117570Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learningMeiyue Shen0Yine Zhang1Runqing Zhan2Tingwei Du3Peixuan Shen4Xiaochuan Lu5Shengnan Liu6Rongrong Guo7Xiaoli Shen8Department of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, ChinaNingxia Center for Disease Control and Prevention, Yinchuan, ChinaQingdao Haici Hospital, Qingdao 266033, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, China; Ningxia Center for Disease Control and Prevention, Yinchuan, China; Qingdao Haici Hospital, Qingdao 266033, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, ChinaDepartment of Epidemiology and Health Statistics, School of Public Health, Qingdao University, Qingdao 266071, China; Correspondence to: Department of Epidemiology and Health Statistics, School of Public Health, Qingdao University, No. 308 Ningxia Rd, Qingdao 266071, China.Machine learning exhibits excellent performance in terms of predictive power. We aimed to construct an interpretable machine learning model utilizing National Health and Nutrition Examination Survey data to investigate the relationship between heavy metal exposure and cardiovascular disease (CVD). A total of 4600 adults were included in the analysis. The Least Absolute Shrinkage and Selection Operator regression method was employed to select relevant feature variables. Subsequently, six machine learning models were constructed, including random forest, decision tree, gradient boosting decision tree, k-nearest neighbor, support vector machine, and AdaBoost algorithms. Feature importance analysis, partial dependence plot, and shapley additive explanations were integrated to enhance the interpretability of the CVD prediction model. Among all models, the random forest exhibited the best performance, with an accuracy of 90 %, an area under the curve of 0.85, and an F1 score of 0.86. Urine cadmium (Cd), blood lead (Pb), urine thallium (Tl), and urine tungsten (W) were identified as the most significant predictors of CVD, with importance scores of 0.062, 0.057, 0.051, and 0.050, respectively. At the overall level, higher levels of urine Cd, blood Pb, and urine W were associated with an increased risk of CVD, whereas a lower level of urine Tl was linked to a reduced CVD risk. Additionally, the analysis of synergistic effects revealed that Cd was the predominant determinant of CVD risk. The random forest-based CVD prediction model demonstrated excellent predictive power and provided valuable insights for personalized patient care and optimal resource allocation in populations exposed to heavy metals.http://www.sciencedirect.com/science/article/pii/S0147651324016464Cardiovascular diseaseHeavy metalsMachine learningRandom forestAdaBoostPartial dependence plot
spellingShingle Meiyue Shen
Yine Zhang
Runqing Zhan
Tingwei Du
Peixuan Shen
Xiaochuan Lu
Shengnan Liu
Rongrong Guo
Xiaoli Shen
Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
Ecotoxicology and Environmental Safety
Cardiovascular disease
Heavy metals
Machine learning
Random forest
AdaBoost
Partial dependence plot
title Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
title_full Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
title_fullStr Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
title_full_unstemmed Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
title_short Predicting the risk of cardiovascular disease in adults exposed to heavy metals: Interpretable machine learning
title_sort predicting the risk of cardiovascular disease in adults exposed to heavy metals interpretable machine learning
topic Cardiovascular disease
Heavy metals
Machine learning
Random forest
AdaBoost
Partial dependence plot
url http://www.sciencedirect.com/science/article/pii/S0147651324016464
work_keys_str_mv AT meiyueshen predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT yinezhang predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT runqingzhan predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT tingweidu predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT peixuanshen predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT xiaochuanlu predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT shengnanliu predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT rongrongguo predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning
AT xiaolishen predictingtheriskofcardiovasculardiseaseinadultsexposedtoheavymetalsinterpretablemachinelearning