Detection of Novel Gene Biomarkers in non-small cell lung cancer using integrated approaches in DNA methylation expression

Tun-Wen Pai
{"title":"Detection of Novel Gene Biomarkers in non-small cell lung cancer using integrated approaches in DNA methylation expression","authors":"Tun-Wen Pai","doi":"10.1145/3571532.3571534","DOIUrl":null,"url":null,"abstract":"Lung cancer is one of primal and ubiquitous cause of cancer related fatalities in the world. Leading cause of these fatalities is non-small cell lung cancer (NSCLC) with a percentage of 85. The major subtypes of NSCLC are Lung Adenocarcinoma (LUAD), Lung Squamous cell carcinoma (LUSC), and Large cell carcinoma. Early-stage surgical detection and removal of tumor offers a favorable prognosis and better survival rates. However, more than 75% patients have stage III/IV at the time of diagnosis and despite advanced major developments in oncology survival rates remain poor. Carcinogens produce widespread DNA methylation changes within cells. These changes are characterized by globally hyper or hypo methylated regions around CpG islands. Many of these changes occur early in tumorigenesis and are highly prevalent across a tumor type. In this study, DNA methylation profiles were extracted from TCGA for 418 LUAD and 370 LUSC tissue samples from patients compared with 32 and 42 non-malignant ones respectively. A standard pipeline was performed to consider significant differentially methylated sites as primary biomarkers, while secondary biomarkers were obtained from associated comorbidities and associated disease genes from meta-analysis study. Concordant candidates were utilized for NSCLC relevant biomarker candidates. Gene ontology annotations were used to calculate gene-pair distance matrix for all candidate biomarkers. Clustering algorithms were utilized to categorize candidate genes into different functional groups using gene distance matrix. There were 35 CpG loci identified as key biomarkers by comparing TCGA training cohort with GEO testing cohort from these functional groups.","PeriodicalId":355088,"journal":{"name":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571532.3571534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Lung cancer is one of primal and ubiquitous cause of cancer related fatalities in the world. Leading cause of these fatalities is non-small cell lung cancer (NSCLC) with a percentage of 85. The major subtypes of NSCLC are Lung Adenocarcinoma (LUAD), Lung Squamous cell carcinoma (LUSC), and Large cell carcinoma. Early-stage surgical detection and removal of tumor offers a favorable prognosis and better survival rates. However, more than 75% patients have stage III/IV at the time of diagnosis and despite advanced major developments in oncology survival rates remain poor. Carcinogens produce widespread DNA methylation changes within cells. These changes are characterized by globally hyper or hypo methylated regions around CpG islands. Many of these changes occur early in tumorigenesis and are highly prevalent across a tumor type. In this study, DNA methylation profiles were extracted from TCGA for 418 LUAD and 370 LUSC tissue samples from patients compared with 32 and 42 non-malignant ones respectively. A standard pipeline was performed to consider significant differentially methylated sites as primary biomarkers, while secondary biomarkers were obtained from associated comorbidities and associated disease genes from meta-analysis study. Concordant candidates were utilized for NSCLC relevant biomarker candidates. Gene ontology annotations were used to calculate gene-pair distance matrix for all candidate biomarkers. Clustering algorithms were utilized to categorize candidate genes into different functional groups using gene distance matrix. There were 35 CpG loci identified as key biomarkers by comparing TCGA training cohort with GEO testing cohort from these functional groups.
利用DNA甲基化表达的综合方法检测非小细胞肺癌中新的基因生物标志物
肺癌是世界上癌症相关死亡的主要和普遍原因之一。这些死亡的主要原因是非小细胞肺癌(NSCLC),占85%。非小细胞肺癌的主要亚型有肺腺癌(LUAD)、肺鳞状细胞癌(LUSC)和大细胞癌。早期手术发现和切除肿瘤提供了良好的预后和更好的生存率。然而,在诊断时,超过75%的患者处于III/IV期,尽管肿瘤学取得了重大进展,但生存率仍然很低。致癌物在细胞内产生广泛的DNA甲基化变化。这些变化的特征是CpG岛周围的全球高甲基化或低甲基化区域。许多这些变化发生在肿瘤发生的早期,并且在肿瘤类型中非常普遍。在本研究中,从TCGA中提取了418例LUAD和370例LUSC患者的DNA甲基化谱,与32例和42例非恶性患者的DNA甲基化谱进行比较。采用标准管道将显著差异甲基化位点作为主要生物标志物,而从meta分析研究的相关合并症和相关疾病基因中获得次要生物标志物。一致性候选物用于NSCLC相关生物标志物候选物。使用基因本体注释计算所有候选生物标记物的基因对距离矩阵。采用聚类算法,利用基因距离矩阵将候选基因划分为不同的功能群。通过比较这些功能群的TCGA训练组和GEO测试组,共鉴定出35个CpG位点为关键生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信