Application of Machine Learning to Development of Copy Number Variation-based Prediction of Cancer Risk.

Genomics insights Pub Date : 2014-06-26 eCollection Date: 2014-01-01 DOI:10.4137/GEI.S15002
Xiaofan Ding, Shui-Ying Tsang, Siu-Kin Ng, Hong Xue
{"title":"Application of Machine Learning to Development of Copy Number Variation-based Prediction of Cancer Risk.","authors":"Xiaofan Ding,&nbsp;Shui-Ying Tsang,&nbsp;Siu-Kin Ng,&nbsp;Hong Xue","doi":"10.4137/GEI.S15002","DOIUrl":null,"url":null,"abstract":"<p><p>In the present study, recurrent copy number variations (CNVs) from non-tumor blood cell DNAs of Caucasian non-cancer subjects and glioma, myeloma, and colorectal cancer-patients, and Korean non-cancer subjects and hepatocellular carcinoma, gastric cancer, and colorectal cancer patients, were found to reveal for each of the two ethnic cohorts highly significant differences between cancer patients and controls with respect to the number of CN-losses and size-distribution of CN-gains, suggesting the existence of recurrent constitutional CNV-features useful for prediction of predisposition to cancer. Upon identification by machine learning, such CNV-features could extensively discriminate between cancer-patient and control DNAs. When the CNV-features selected from a learning-group of Caucasian or Korean mixed DNAs consisting of both cancer-patient and control DNAs were employed to make predictions on the cancer predisposition of an unseen test group of mixed DNAs, the average prediction accuracy was 93.6% for the Caucasian cohort and 86.5% for the Korean cohort. </p>","PeriodicalId":88494,"journal":{"name":"Genomics insights","volume":"7 ","pages":"1-11"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.4137/GEI.S15002","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4137/GEI.S15002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2014/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

In the present study, recurrent copy number variations (CNVs) from non-tumor blood cell DNAs of Caucasian non-cancer subjects and glioma, myeloma, and colorectal cancer-patients, and Korean non-cancer subjects and hepatocellular carcinoma, gastric cancer, and colorectal cancer patients, were found to reveal for each of the two ethnic cohorts highly significant differences between cancer patients and controls with respect to the number of CN-losses and size-distribution of CN-gains, suggesting the existence of recurrent constitutional CNV-features useful for prediction of predisposition to cancer. Upon identification by machine learning, such CNV-features could extensively discriminate between cancer-patient and control DNAs. When the CNV-features selected from a learning-group of Caucasian or Korean mixed DNAs consisting of both cancer-patient and control DNAs were employed to make predictions on the cancer predisposition of an unseen test group of mixed DNAs, the average prediction accuracy was 93.6% for the Caucasian cohort and 86.5% for the Korean cohort.

Abstract Image

Abstract Image

Abstract Image

机器学习在基于拷贝数变化的癌症风险预测中的应用。
在本研究中,发现高加索非癌症受试者和胶质瘤、骨髓瘤和结直肠癌患者,以及韩国非癌症受试者和肝细胞癌、胃癌和结直肠癌患者的非肿瘤血细胞dna的复发性拷贝数变异(CNVs)显示,在两个种族队列中,癌症患者和对照组之间在cn -损失的数量和cn -获得的大小分布方面存在高度显著差异。提示存在可用于预测癌症易感性的复发性体质cnv特征。通过机器学习识别,这种cnv特征可以广泛区分癌症患者和控制dna。当从由癌症患者和对照dna组成的白人或韩国混合dna学习组中选择cnv特征用于预测未知混合dna测试组的癌症易感时,白人队列的平均预测准确率为93.6%,韩国队列的平均预测准确率为86.5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信