Subtyping glioblastoma by combining miRNA and mRNA expression data using compressed sensing-based approach.

Wenlong Tang, Junbo Duan, Ji-Gang Zhang, Yu-Ping Wang
{"title":"Subtyping glioblastoma by combining miRNA and mRNA expression data using compressed sensing-based approach.","authors":"Wenlong Tang,&nbsp;Junbo Duan,&nbsp;Ji-Gang Zhang,&nbsp;Yu-Ping Wang","doi":"10.1186/1687-4153-2013-2","DOIUrl":null,"url":null,"abstract":"<p><p>In the clinical practice, many diseases such as glioblastoma, leukemia, diabetes, and prostates have multiple subtypes. Classifying subtypes accurately using genomic data will provide individualized treatments to target-specific disease subtypes. However, it is often difficult to obtain satisfactory classification accuracy using only one type of data, because the subtypes of a disease can exhibit similar patterns in one data type. Fortunately, multiple types of genomic data are often available due to the rapid development of genomic techniques. This raises the question on whether the classification performance can significantly be improved by combining multiple types of genomic data. In this article, we classified four subtypes of glioblastoma multiforme (GBM) with multiple types of genome-wide data (e.g., mRNA and miRNA expression) from The Cancer Genome Atlas (TCGA) project. We proposed a multi-class compressed sensing-based detector (MCSD) for this study. The MCSD was trained with data from TCGA and then applied to subtype GBM patients using an independent testing data. We performed the classification on the same patient subjects with three data types, i.e., miRNA expression data, mRNA (or gene expression) data, and their combinations. The classification accuracy is 69.1% with the miRNA expression data, 52.7% with mRNA expression data, and 90.9% with the combination of both mRNA and miRNA expression data. In addition, some biomarkers identified by the integrated approaches have been confirmed with results from the published literatures. These results indicate that the combined analysis can significantly improve the accuracy of classifying GBM subtypes and identify potential biomarkers for disease diagnosis.</p>","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":"2013 1","pages":"2"},"PeriodicalIF":0.0000,"publicationDate":"2013-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1687-4153-2013-2","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURASIP journal on bioinformatics & systems biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1687-4153-2013-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

In the clinical practice, many diseases such as glioblastoma, leukemia, diabetes, and prostates have multiple subtypes. Classifying subtypes accurately using genomic data will provide individualized treatments to target-specific disease subtypes. However, it is often difficult to obtain satisfactory classification accuracy using only one type of data, because the subtypes of a disease can exhibit similar patterns in one data type. Fortunately, multiple types of genomic data are often available due to the rapid development of genomic techniques. This raises the question on whether the classification performance can significantly be improved by combining multiple types of genomic data. In this article, we classified four subtypes of glioblastoma multiforme (GBM) with multiple types of genome-wide data (e.g., mRNA and miRNA expression) from The Cancer Genome Atlas (TCGA) project. We proposed a multi-class compressed sensing-based detector (MCSD) for this study. The MCSD was trained with data from TCGA and then applied to subtype GBM patients using an independent testing data. We performed the classification on the same patient subjects with three data types, i.e., miRNA expression data, mRNA (or gene expression) data, and their combinations. The classification accuracy is 69.1% with the miRNA expression data, 52.7% with mRNA expression data, and 90.9% with the combination of both mRNA and miRNA expression data. In addition, some biomarkers identified by the integrated approaches have been confirmed with results from the published literatures. These results indicate that the combined analysis can significantly improve the accuracy of classifying GBM subtypes and identify potential biomarkers for disease diagnosis.

Abstract Image

Abstract Image

Abstract Image

利用基于压缩感知的方法结合miRNA和mRNA表达数据进行胶质母细胞瘤亚型分型。
在临床实践中,许多疾病如胶质母细胞瘤、白血病、糖尿病和前列腺癌都有多个亚型。利用基因组数据准确地分类亚型将为针对特定目标的疾病亚型提供个性化治疗。然而,仅使用一种类型的数据往往难以获得令人满意的分类准确性,因为一种疾病的亚型可能在一种数据类型中表现出相似的模式。幸运的是,由于基因组技术的快速发展,多种类型的基因组数据通常是可用的。这就提出了一个问题,即结合多种类型的基因组数据是否可以显著提高分类性能。在这篇文章中,我们从癌症基因组图谱(TCGA)项目中使用多种类型的全基因组数据(例如mRNA和miRNA表达)将多形性胶质母细胞瘤(GBM)分类为四种亚型。为此,我们提出了一种基于多类压缩感知的检测器(MCSD)。MCSD使用TCGA数据进行训练,然后使用独立测试数据应用于亚型GBM患者。我们使用三种数据类型对同一患者受试者进行分类,即miRNA表达数据、mRNA(或基因表达)数据及其组合。miRNA表达数据的分类准确率为69.1%,mRNA表达数据的分类准确率为52.7%,mRNA和miRNA联合表达数据的分类准确率为90.9%。此外,通过综合方法鉴定的一些生物标志物已与已发表的文献结果相证实。这些结果表明,联合分析可以显著提高GBM亚型分类的准确性,并识别潜在的疾病诊断生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信