Discovery of Stomach Adenocarcinoma Biomarkers by Consensus Scoring of Random Sampling and Machine Learning Modeling

Ji Chen, Yang Hao, Tianjun Wang, Daiyun Huang, Xin Liu
{"title":"Discovery of Stomach Adenocarcinoma Biomarkers by Consensus Scoring of Random Sampling and Machine Learning Modeling","authors":"Ji Chen, Yang Hao, Tianjun Wang, Daiyun Huang, Xin Liu","doi":"10.1109/icbcb55259.2022.9802469","DOIUrl":null,"url":null,"abstract":"Stomach adenocarcinoma (STAD) is a subtype of gastric cancer with high incidence and mortality. Lack of early detection results in the poor prognosis of this cancer, leading to low survival rate of patients. In this study, machine learning methods, specifically support vector machine (SVM) based recursive feature elimination (SVM-RFE), were applied to discover the potential biomarkers of STAD with the data form the Cancer Genome Atlas (TCGA). After the optimal parameter set was determined, random sampling was conducted to minimize the limitation caused by small sample size (64 paired tumor and adjacent non-tumor samples). As a result, five genes (COL10A1, CST1, ESM1, HOXC11 and HOXC9) were identified to be essential to the predictive model built by SVM-RFE. In addition, other three genes GAD1, HOXA11 and PRKCG are of less importance but still could be potential biomarkers of STAD.","PeriodicalId":429633,"journal":{"name":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icbcb55259.2022.9802469","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Stomach adenocarcinoma (STAD) is a subtype of gastric cancer with high incidence and mortality. Lack of early detection results in the poor prognosis of this cancer, leading to low survival rate of patients. In this study, machine learning methods, specifically support vector machine (SVM) based recursive feature elimination (SVM-RFE), were applied to discover the potential biomarkers of STAD with the data form the Cancer Genome Atlas (TCGA). After the optimal parameter set was determined, random sampling was conducted to minimize the limitation caused by small sample size (64 paired tumor and adjacent non-tumor samples). As a result, five genes (COL10A1, CST1, ESM1, HOXC11 and HOXC9) were identified to be essential to the predictive model built by SVM-RFE. In addition, other three genes GAD1, HOXA11 and PRKCG are of less importance but still could be potential biomarkers of STAD.
通过随机抽样和机器学习建模的共识评分发现胃腺癌生物标志物
胃腺癌(STAD)是一种发病率高、死亡率高的胃癌亚型。由于缺乏早期发现,导致该癌症预后差,导致患者生存率低。本研究采用机器学习方法,特别是基于支持向量机(SVM)的递归特征消除(SVM- rfe),利用来自癌症基因组图谱(TCGA)的数据发现STAD的潜在生物标志物。在确定最优参数集后,进行随机抽样,以最大限度地减少样本量小(64个成对的肿瘤和相邻的非肿瘤样本)所带来的限制。结果发现,5个基因(COL10A1、CST1、ESM1、HOXC11和HOXC9)对SVM-RFE构建的预测模型至关重要。此外,GAD1、HOXA11和PRKCG三个基因的重要性较低,但仍可能是STAD的潜在生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信