Identification of key risk factors for venous thromboembolism in urological inpatients based on the Caprini scale and interpretable machine learning methods.
{"title":"Identification of key risk factors for venous thromboembolism in urological inpatients based on the Caprini scale and interpretable machine learning methods.","authors":"Chao Liu, Wei-Ying Yang, Fengmin Cheng, Ching-Wen Chien, Yen-Ching Chuang, Yanjun Jin","doi":"10.1186/s12959-024-00645-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To identify the key risk factors for venous thromboembolism (VTE) in urological inpatients based on the Caprini scale using an interpretable machine learning method.</p><p><strong>Methods: </strong>VTE risk data of urological inpatients were obtained based on the Caprini scale in the case hospital. Based on the data, the Boruta method was used to further select the key variables from the 37 variables in the Caprini scale. Furthermore, decision rules corresponding to each risk level were generated using the rough set (RS) method. Finally, random forest (RF), support vector machine (SVM), and backpropagation artificial neural network (BPANN) were used to verify the data accuracy and were compared with the RS method.</p><p><strong>Results: </strong>Following the screening, the key risk factors for VTE in urology were \"(C<sub>1</sub>) Age,\" \"(C<sub>2</sub>) Minor Surgery planned,\" \"(C<sub>3</sub>) Obesity (BMI > 25),\" \"(C<sub>8</sub>) Varicose veins,\" \"(C<sub>9</sub>) Sepsis (< 1 month),\" (C<sub>10</sub>) \"Serious lung disease incl. pneumonia (< 1month) \" (C<sub>11</sub>) COPD,\" \"(C<sub>16</sub>) Other risk,\" \"(C<sub>18</sub>) Major surgery (> 45 min),\" \"(C<sub>19</sub>) Laparoscopic surgery (> 45 min),\" \"(C<sub>20</sub>) Patient confined to bed (> 72 h),\" \"(C18) Malignancy (present or previous),\" \"(C<sub>23</sub>) Central venous access,\" \"(C<sub>31</sub>) History of DVT/PE,\" \"(C<sub>32</sub>) Other congenital or acquired thrombophilia,\" and \"(C<sub>34</sub>) Stroke (< 1 month.\" According to the decision rules of different risk levels obtained using the RS method, \"(C<sub>1</sub>) Age,\" \"(C<sub>18</sub>) Major surgery (> 45 minutes),\" and \"(C<sub>21</sub>) Malignancy (present or previous)\" were the main factors influencing mid- and high-risk levels, and some suggestions on VTE prevention were indicated based on these three factors. The average accuracies of the RS, RF, SVM, and BPANN models were 79.5%, 87.9%, 92.6%, and 97.2%, respectively. In addition, BPANN had the highest accuracy, recall, F1-score, and precision.</p><p><strong>Conclusions: </strong>The RS model achieved poorer accuracy than the other three common machine learning models. However, the RS model provides strong interpretability and allows for the identification of high-risk factors and decision rules influencing high-risk assessments of VTE in urology. This transparency is very important for clinicians in the risk assessment process.</p>","PeriodicalId":22982,"journal":{"name":"Thrombosis Journal","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11328390/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Thrombosis Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12959-024-00645-0","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To identify the key risk factors for venous thromboembolism (VTE) in urological inpatients based on the Caprini scale using an interpretable machine learning method.
Methods: VTE risk data of urological inpatients were obtained based on the Caprini scale in the case hospital. Based on the data, the Boruta method was used to further select the key variables from the 37 variables in the Caprini scale. Furthermore, decision rules corresponding to each risk level were generated using the rough set (RS) method. Finally, random forest (RF), support vector machine (SVM), and backpropagation artificial neural network (BPANN) were used to verify the data accuracy and were compared with the RS method.
Results: Following the screening, the key risk factors for VTE in urology were "(C1) Age," "(C2) Minor Surgery planned," "(C3) Obesity (BMI > 25)," "(C8) Varicose veins," "(C9) Sepsis (< 1 month)," (C10) "Serious lung disease incl. pneumonia (< 1month) " (C11) COPD," "(C16) Other risk," "(C18) Major surgery (> 45 min)," "(C19) Laparoscopic surgery (> 45 min)," "(C20) Patient confined to bed (> 72 h)," "(C18) Malignancy (present or previous)," "(C23) Central venous access," "(C31) History of DVT/PE," "(C32) Other congenital or acquired thrombophilia," and "(C34) Stroke (< 1 month." According to the decision rules of different risk levels obtained using the RS method, "(C1) Age," "(C18) Major surgery (> 45 minutes)," and "(C21) Malignancy (present or previous)" were the main factors influencing mid- and high-risk levels, and some suggestions on VTE prevention were indicated based on these three factors. The average accuracies of the RS, RF, SVM, and BPANN models were 79.5%, 87.9%, 92.6%, and 97.2%, respectively. In addition, BPANN had the highest accuracy, recall, F1-score, and precision.
Conclusions: The RS model achieved poorer accuracy than the other three common machine learning models. However, the RS model provides strong interpretability and allows for the identification of high-risk factors and decision rules influencing high-risk assessments of VTE in urology. This transparency is very important for clinicians in the risk assessment process.
期刊介绍:
Thrombosis Journal is an open-access journal that publishes original articles on aspects of clinical and basic research, new methodology, case reports and reviews in the areas of thrombosis.
Topics of particular interest include the diagnosis of arterial and venous thrombosis, new antithrombotic treatments, new developments in the understanding, diagnosis and treatments of atherosclerotic vessel disease, relations between haemostasis and vascular disease, hypertension, diabetes, immunology and obesity.