Yan Liu , Yangyang Geng , Liuqing Yang , Shate Xiang , Qiaotong Wang , Lanyawen Hu , Ping Ye
{"title":"Traditional Chinese Medicine Constitution and Clinical Data Association with Machine Learning for Prediction of Spontaneous Abortion","authors":"Yan Liu , Yangyang Geng , Liuqing Yang , Shate Xiang , Qiaotong Wang , Lanyawen Hu , Ping Ye","doi":"10.1016/j.ccmp.2021.100016","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Early prevention of Spontaneous Abortion (SA) is essential for the treatment of recurrent spontaneous abortion.</p></div><div><h3>Objective</h3><p>In this retrospective study, a variety of machine learning methods were used to develop predictive models and diagnose the potential risk of SA.</p></div><div><h3>Methods</h3><p>A total of 663 pregnant women participated in the case-control study, 586 of which were SA patients and 77 were normal parturition women. The research data included 25 features of Traditional Chinese Medicine (TCM) constitution and clinical data related to SA. This work utilized 8 machine learning techniques including logistic regression, gradient boosting decision tree, k-nearest neighbor, classification and r-egression tree, multilayer perceptron, support vector machine, random forest and XG-Boost to predict SA. The performances of the applied models were evaluated by using the method of 10-fold cross-validation and by computing the diagnostic test characteristics, including accuracy, precision, recall, <span><math><msub><mi>F</mi><mn>1</mn></msub></math></span> score, and the AUC of ROC curve.</p></div><div><h3>Results</h3><p>The <span><math><msub><mi>F</mi><mn>1</mn></msub></math></span> scores of these eight machine learning techniques were all above 97.5%. Among them, gradient boosting decision tree had the best prediction result on SA. The accuracy, precision, recall, <span><math><msub><mi>F</mi><mn>1</mn></msub></math></span> score, and the AUC of ROC curve of gradient boosting decision tree were 97.9%, 99%, 98.6%, 98.8%, and 97.3%, respectively.</p></div><div><h3>Conclusion</h3><p>The paper has accurately predicted the risk of SA combined with TCM constitution and clinical data.</p></div>","PeriodicalId":72608,"journal":{"name":"Clinical complementary medicine and pharmacology","volume":"2 2","pages":"Article 100016"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772371221000164/pdfft?md5=d4ab712a1e67bf47d86b9212ea8cd90c&pid=1-s2.0-S2772371221000164-main.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical complementary medicine and pharmacology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772371221000164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Background
Early prevention of Spontaneous Abortion (SA) is essential for the treatment of recurrent spontaneous abortion.
Objective
In this retrospective study, a variety of machine learning methods were used to develop predictive models and diagnose the potential risk of SA.
Methods
A total of 663 pregnant women participated in the case-control study, 586 of which were SA patients and 77 were normal parturition women. The research data included 25 features of Traditional Chinese Medicine (TCM) constitution and clinical data related to SA. This work utilized 8 machine learning techniques including logistic regression, gradient boosting decision tree, k-nearest neighbor, classification and r-egression tree, multilayer perceptron, support vector machine, random forest and XG-Boost to predict SA. The performances of the applied models were evaluated by using the method of 10-fold cross-validation and by computing the diagnostic test characteristics, including accuracy, precision, recall, score, and the AUC of ROC curve.
Results
The scores of these eight machine learning techniques were all above 97.5%. Among them, gradient boosting decision tree had the best prediction result on SA. The accuracy, precision, recall, score, and the AUC of ROC curve of gradient boosting decision tree were 97.9%, 99%, 98.6%, 98.8%, and 97.3%, respectively.
Conclusion
The paper has accurately predicted the risk of SA combined with TCM constitution and clinical data.