机器学习在体外受精中的应用:比较双胞胎预测分类算法的准确性

Surgery, gynecology & obstetrics Pub Date : 2019-01-01 DOI:10.4172/2161-0932.1000497

R. John

{"title":"机器学习在体外受精中的应用:比较双胞胎预测分类算法的准确性","authors":"R. John","doi":"10.4172/2161-0932.1000497","DOIUrl":null,"url":null,"abstract":"Background: Clinical decision-making dilemmas are particularly notable in IVF practice, given that large datasets are often generated which enable clinicians to make predictions that inform treatment choices. This study applied machine learning by using IVF data to determine the risk of twins when two or more embryos are available for transfer. While most classifiers are able to provide estimates of accuracy, this study went further by comparing classifiers both by accuracy and Area Under the Curve (AUC).Methods: Study data were derived from a large electronic medical record system that is utilized by over 140 IVF clinics and contained 135,000 IVF cycles. The dataset was reduced from 88 variables to 40 and included only those cycles of IVF where two or more blastocyst embryos were created. The following classifiers were compared in terms of accuracy and AUC: a generalized linear model, linear discriminant analysis, quadratic discriminant analysis, K-nearest neighbors, support vector machine, random forests, and boosting. A stacking ensemble learning algorithm was also applied in order to use predictions from classifiers to create a new model.Results: While the ensemble classifier was the most accurate, none of the classifiers predominated as being significantly superior to other classifiers. Findings indicated that boosting methods for classifiers performed poorly; logistic and linear discriminant analysis classifiers performed better than the quadratic discriminant analysis classifier, and the support vector machine performed almost as well as the tree classifier. AUC results were consistent with the comparisons for accuracy. External validation was also performed using a different dataset containing 588 observations. All models performed better using the external validation dataset, with the random forest classifier performing markedly better than any other classifier.Conclusions: These results support the impression that big data can be of value in the clinical decision-making process; but that no single statistical algorithm provides maximum accuracy for all databases. Therefore, different datasets will require investigation in order to determine which algorithms are the most accurate for a particular set of data. These findings underscore the premise that clinicians with access to large amounts of data can use advanced predictive analytic models to create robust clinical information of vital importance for patient care.","PeriodicalId":22164,"journal":{"name":"Surgery, gynecology & obstetrics","volume":"75 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"An Application of Machine Learning in IVF: Comparing the Accuracy of Classification Alogithims for the Prediction of Twins\",\"authors\":\"R. John\",\"doi\":\"10.4172/2161-0932.1000497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Clinical decision-making dilemmas are particularly notable in IVF practice, given that large datasets are often generated which enable clinicians to make predictions that inform treatment choices. This study applied machine learning by using IVF data to determine the risk of twins when two or more embryos are available for transfer. While most classifiers are able to provide estimates of accuracy, this study went further by comparing classifiers both by accuracy and Area Under the Curve (AUC).Methods: Study data were derived from a large electronic medical record system that is utilized by over 140 IVF clinics and contained 135,000 IVF cycles. The dataset was reduced from 88 variables to 40 and included only those cycles of IVF where two or more blastocyst embryos were created. The following classifiers were compared in terms of accuracy and AUC: a generalized linear model, linear discriminant analysis, quadratic discriminant analysis, K-nearest neighbors, support vector machine, random forests, and boosting. A stacking ensemble learning algorithm was also applied in order to use predictions from classifiers to create a new model.Results: While the ensemble classifier was the most accurate, none of the classifiers predominated as being significantly superior to other classifiers. Findings indicated that boosting methods for classifiers performed poorly; logistic and linear discriminant analysis classifiers performed better than the quadratic discriminant analysis classifier, and the support vector machine performed almost as well as the tree classifier. AUC results were consistent with the comparisons for accuracy. External validation was also performed using a different dataset containing 588 observations. All models performed better using the external validation dataset, with the random forest classifier performing markedly better than any other classifier.Conclusions: These results support the impression that big data can be of value in the clinical decision-making process; but that no single statistical algorithm provides maximum accuracy for all databases. Therefore, different datasets will require investigation in order to determine which algorithms are the most accurate for a particular set of data. These findings underscore the premise that clinicians with access to large amounts of data can use advanced predictive analytic models to create robust clinical information of vital importance for patient care.\",\"PeriodicalId\":22164,\"journal\":{\"name\":\"Surgery, gynecology & obstetrics\",\"volume\":\"75 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Surgery, gynecology & obstetrics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4172/2161-0932.1000497\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Surgery, gynecology & obstetrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4172/2161-0932.1000497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

背景:临床决策困境在试管婴儿实践中尤为显著，因为通常会产生大量数据集，使临床医生能够做出预测，为治疗选择提供信息。本研究通过使用试管婴儿数据应用机器学习来确定当两个或更多胚胎可供移植时双胞胎的风险。虽然大多数分类器能够提供准确性的估计，但本研究通过比较准确率和曲线下面积(AUC)来进一步研究分类器。方法:研究数据来自一个大型电子病历系统，该系统由140多个试管婴儿诊所使用，包含135,000个试管婴儿周期。数据集从88个变量减少到40个，并且只包括那些产生两个或更多囊胚的试管受精周期。从准确率和AUC方面比较了以下分类器:广义线性模型、线性判别分析、二次判别分析、k近邻、支持向量机、随机森林和boosting。为了使用分类器的预测来创建新模型，还应用了堆叠集成学习算法。结果:虽然集成分类器是最准确的，但没有一个分类器明显优于其他分类器。研究结果表明，分类器的增强方法性能较差;逻辑和线性判别分析分类器的性能优于二次判别分析分类器，支持向量机的性能几乎与树分类器相当。AUC结果与准确度比较一致。外部验证也使用包含588个观测值的不同数据集进行。使用外部验证数据集，所有模型都表现得更好，随机森林分类器的表现明显好于任何其他分类器。结论:这些结果支持了大数据在临床决策过程中具有价值的印象;但是，没有一种统计算法能为所有数据库提供最高的准确性。因此，需要对不同的数据集进行调查，以确定哪种算法对特定数据集最准确。这些发现强调了一个前提，即拥有大量数据的临床医生可以使用先进的预测分析模型来创建对患者护理至关重要的可靠临床信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Application of Machine Learning in IVF: Comparing the Accuracy of Classification Alogithims for the Prediction of Twins

Background: Clinical decision-making dilemmas are particularly notable in IVF practice, given that large datasets are often generated which enable clinicians to make predictions that inform treatment choices. This study applied machine learning by using IVF data to determine the risk of twins when two or more embryos are available for transfer. While most classifiers are able to provide estimates of accuracy, this study went further by comparing classifiers both by accuracy and Area Under the Curve (AUC).Methods: Study data were derived from a large electronic medical record system that is utilized by over 140 IVF clinics and contained 135,000 IVF cycles. The dataset was reduced from 88 variables to 40 and included only those cycles of IVF where two or more blastocyst embryos were created. The following classifiers were compared in terms of accuracy and AUC: a generalized linear model, linear discriminant analysis, quadratic discriminant analysis, K-nearest neighbors, support vector machine, random forests, and boosting. A stacking ensemble learning algorithm was also applied in order to use predictions from classifiers to create a new model.Results: While the ensemble classifier was the most accurate, none of the classifiers predominated as being significantly superior to other classifiers. Findings indicated that boosting methods for classifiers performed poorly; logistic and linear discriminant analysis classifiers performed better than the quadratic discriminant analysis classifier, and the support vector machine performed almost as well as the tree classifier. AUC results were consistent with the comparisons for accuracy. External validation was also performed using a different dataset containing 588 observations. All models performed better using the external validation dataset, with the random forest classifier performing markedly better than any other classifier.Conclusions: These results support the impression that big data can be of value in the clinical decision-making process; but that no single statistical algorithm provides maximum accuracy for all databases. Therefore, different datasets will require investigation in order to determine which algorithms are the most accurate for a particular set of data. These findings underscore the premise that clinicians with access to large amounts of data can use advanced predictive analytic models to create robust clinical information of vital importance for patient care.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Surgery, gynecology & obstetrics

自引率

0.00%

发文量