基于多组学的机器学习模型预测头颈部鳞状细胞癌生存结果的比较

IF 1.8 4区 生物学 Q4 BIOCHEMISTRY & MOLECULAR BIOLOGY
Liying Mo, Yuangang Su, Jianhui Yuan, Zhiwei Xiao, Ziyan Zhang, Xiuwan Lan, Daizheng Huang
{"title":"基于多组学的机器学习模型预测头颈部鳞状细胞癌生存结果的比较","authors":"Liying Mo,&nbsp;Yuangang Su,&nbsp;Jianhui Yuan,&nbsp;Zhiwei Xiao,&nbsp;Ziyan Zhang,&nbsp;Xiuwan Lan,&nbsp;Daizheng Huang","doi":"10.2174/1389202923666220204153744","DOIUrl":null,"url":null,"abstract":"<p><p><b><i>Background</i>:</b> Machine learning methods showed excellent predictive ability in a wide range of fields. For the survival of head and neck squamous cell carcinoma (HNSC), its multi-omics influence is crucial. This study attempts to establish a variety of machine learning multi-omics models to predict the survival of HNSC and find the most suitable machine learning prediction method. <b><i>Methods</i>:</b> The HNSC clinical data and multi-omics data were downloaded from the TCGA database. The important variables were screened by the LASSO algorithm. We used a total of 12 supervised machine learning models to predict the outcome of HNSC survival and compared the results. <i>In vitro</i> qPCR was performed to verify core genes predicted by the random forest algorithm. <b><i>Results</i>:</b> For omics of HNSC, the results of the twelve models showed that the performance of multi-omics was better than each single-omic alone. Results were presented, which showed that the Bayesian network(BN) model (area under the curve [AUC] 0.8250, F1 score=0.7917) and random forest(RF) model (area under the curve [AUC] 0.8002,F1 score=0.7839) played good prediction performance in HNSC multi-omics data. The results of <i>in vitro</i> qPCR were consistent with the RF algorithm. <b><i>Conclusion</i>:</b> Machine learning methods could better forecast the survival outcome of HNSC. Meanwhile, this study found that the BN model and the RF model were the most superior. Moreover, the forecast result of multi-omics was better than single-omic alone in HNSC.</p>","PeriodicalId":10803,"journal":{"name":"Current Genomics","volume":"23 2","pages":"94-108"},"PeriodicalIF":1.8000,"publicationDate":"2022-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a7/b3/CG-23-94.PMC9878835.pdf","citationCount":"1","resultStr":"{\"title\":\"Comparisons of Forecasting for Survival Outcome for Head and Neck Squamous Cell Carcinoma by using Machine Learning Models based on Multi-omics.\",\"authors\":\"Liying Mo,&nbsp;Yuangang Su,&nbsp;Jianhui Yuan,&nbsp;Zhiwei Xiao,&nbsp;Ziyan Zhang,&nbsp;Xiuwan Lan,&nbsp;Daizheng Huang\",\"doi\":\"10.2174/1389202923666220204153744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b><i>Background</i>:</b> Machine learning methods showed excellent predictive ability in a wide range of fields. For the survival of head and neck squamous cell carcinoma (HNSC), its multi-omics influence is crucial. This study attempts to establish a variety of machine learning multi-omics models to predict the survival of HNSC and find the most suitable machine learning prediction method. <b><i>Methods</i>:</b> The HNSC clinical data and multi-omics data were downloaded from the TCGA database. The important variables were screened by the LASSO algorithm. We used a total of 12 supervised machine learning models to predict the outcome of HNSC survival and compared the results. <i>In vitro</i> qPCR was performed to verify core genes predicted by the random forest algorithm. <b><i>Results</i>:</b> For omics of HNSC, the results of the twelve models showed that the performance of multi-omics was better than each single-omic alone. Results were presented, which showed that the Bayesian network(BN) model (area under the curve [AUC] 0.8250, F1 score=0.7917) and random forest(RF) model (area under the curve [AUC] 0.8002,F1 score=0.7839) played good prediction performance in HNSC multi-omics data. The results of <i>in vitro</i> qPCR were consistent with the RF algorithm. <b><i>Conclusion</i>:</b> Machine learning methods could better forecast the survival outcome of HNSC. Meanwhile, this study found that the BN model and the RF model were the most superior. Moreover, the forecast result of multi-omics was better than single-omic alone in HNSC.</p>\",\"PeriodicalId\":10803,\"journal\":{\"name\":\"Current Genomics\",\"volume\":\"23 2\",\"pages\":\"94-108\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2022-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a7/b3/CG-23-94.PMC9878835.pdf\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.2174/1389202923666220204153744\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1389202923666220204153744","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 1

摘要

背景:机器学习方法在广泛的领域表现出出色的预测能力。对于头颈部鳞状细胞癌(HNSC)的生存,其多组学影响至关重要。本研究试图建立多种机器学习多组学模型来预测HNSC的生存,寻找最适合的机器学习预测方法。方法:从TCGA数据库下载HNSC临床资料和多组学资料。采用LASSO算法筛选重要变量。我们总共使用了12个监督机器学习模型来预测HNSC的生存结果,并对结果进行了比较。体外qPCR验证随机森林算法预测的核心基因。结果:对于HNSC组学,12个模型的结果表明,多组学的性能优于单个组学。结果表明,贝叶斯网络(BN)模型(曲线下面积[AUC] 0.8250, F1得分=0.7917)和随机森林(RF)模型(曲线下面积[AUC] 0.8002,F1得分=0.7839)对HNSC多组学数据具有较好的预测效果。体外qPCR结果与RF算法一致。结论:机器学习方法能较好地预测HNSC患者的生存结局。同时,本研究发现BN模型和RF模型最优。多组学对HNSC的预测结果优于单组学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Comparisons of Forecasting for Survival Outcome for Head and Neck Squamous Cell Carcinoma by using Machine Learning Models based on Multi-omics.

Comparisons of Forecasting for Survival Outcome for Head and Neck Squamous Cell Carcinoma by using Machine Learning Models based on Multi-omics.

Background: Machine learning methods showed excellent predictive ability in a wide range of fields. For the survival of head and neck squamous cell carcinoma (HNSC), its multi-omics influence is crucial. This study attempts to establish a variety of machine learning multi-omics models to predict the survival of HNSC and find the most suitable machine learning prediction method. Methods: The HNSC clinical data and multi-omics data were downloaded from the TCGA database. The important variables were screened by the LASSO algorithm. We used a total of 12 supervised machine learning models to predict the outcome of HNSC survival and compared the results. In vitro qPCR was performed to verify core genes predicted by the random forest algorithm. Results: For omics of HNSC, the results of the twelve models showed that the performance of multi-omics was better than each single-omic alone. Results were presented, which showed that the Bayesian network(BN) model (area under the curve [AUC] 0.8250, F1 score=0.7917) and random forest(RF) model (area under the curve [AUC] 0.8002,F1 score=0.7839) played good prediction performance in HNSC multi-omics data. The results of in vitro qPCR were consistent with the RF algorithm. Conclusion: Machine learning methods could better forecast the survival outcome of HNSC. Meanwhile, this study found that the BN model and the RF model were the most superior. Moreover, the forecast result of multi-omics was better than single-omic alone in HNSC.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Current Genomics
Current Genomics 生物-生化与分子生物学
CiteScore
5.20
自引率
0.00%
发文量
29
审稿时长
>0 weeks
期刊介绍: Current Genomics is a peer-reviewed journal that provides essential reading about the latest and most important developments in genome science and related fields of research. Systems biology, systems modeling, machine learning, network inference, bioinformatics, computational biology, epigenetics, single cell genomics, extracellular vesicles, quantitative biology, and synthetic biology for the study of evolution, development, maintenance, aging and that of human health, human diseases, clinical genomics and precision medicine are topics of particular interest. The journal covers plant genomics. The journal will not consider articles dealing with breeding and livestock. Current Genomics publishes three types of articles including: i) Research papers from internationally-recognized experts reporting on new and original data generated at the genome scale level. Position papers dealing with new or challenging methodological approaches, whether experimental or mathematical, are greatly welcome in this section. ii) Authoritative and comprehensive full-length or mini reviews from widely recognized experts, covering the latest developments in genome science and related fields of research such as systems biology, statistics and machine learning, quantitative biology, and precision medicine. Proposals for mini-hot topics (2-3 review papers) and full hot topics (6-8 review papers) guest edited by internationally-recognized experts are welcome in this section. Hot topic proposals should not contain original data and they should contain articles originating from at least 2 different countries. iii) Opinion papers from internationally recognized experts addressing contemporary questions and issues in the field of genome science and systems biology and basic and clinical research practices.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信