An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer.

IF 2.5 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Cancer Informatics Pub Date : 2022-06-14 eCollection Date: 2022-01-01 DOI:10.1177/11769351221100718
Mehdi Hamaneh, Yi-Kuo Yu
{"title":"An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer.","authors":"Mehdi Hamaneh,&nbsp;Yi-Kuo Yu","doi":"10.1177/11769351221100718","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The precise diagnosis of the major subtypes, lung adenocarcinoma and lung squamous cell carcinoma, of non-small-cell lung cancer is of practical importance as some treatments are subtype-specific. However, in some cases diagnosis via the commonly-used method, that is staining the specimen using immunohistochemical markers, may be challenging. Hence, having a computational method that complements the diagnosis is desirable. In this paper, we propose a gene signature for this purpose.</p><p><strong>Results: </strong>We developed an expression-based method that systematically suggests a huge set of candidate gene signatures and finds the best candidate. By applying this method to a training set, the optimal gene signature was found by considering close to 765 billion candidate signatures. The 8-gene signature found for classifying the 2 aforementioned subtypes comprises TP63, CALML3, KRT5, PKP1, TESC, SPINK1, C9orf152, and KRT7. The signature achieved a high overall prediction accuracy of 0.936 when tested using 34 independent gene expression datasets obtained using different technologies and comprising 2556 adenocarcinoma and 1630 squamous cell carcinoma samples. Additionally, the signature performed well in clinically challenging cases, that is poorly differentiated tumors and specimens obtained from biopsies. In comparison with 2 previously reported signatures, our signature performed better in terms of overall accuracy and especially accuracy of classifying lung squamous cell carcinoma.</p><p><strong>Conclusions: </strong>Our signature is easy to use and accurate regardless of the technology used to obtain the gene expression profiles. It performs well even in clinically challenging cases and thus can assist pathologists in diagnosis of the ambiguous cases.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":" ","pages":"11769351221100718"},"PeriodicalIF":2.5000,"publicationDate":"2022-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b9/ce/10.1177_11769351221100718.PMC9201361.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11769351221100718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 3

Abstract

Motivation: The precise diagnosis of the major subtypes, lung adenocarcinoma and lung squamous cell carcinoma, of non-small-cell lung cancer is of practical importance as some treatments are subtype-specific. However, in some cases diagnosis via the commonly-used method, that is staining the specimen using immunohistochemical markers, may be challenging. Hence, having a computational method that complements the diagnosis is desirable. In this paper, we propose a gene signature for this purpose.

Results: We developed an expression-based method that systematically suggests a huge set of candidate gene signatures and finds the best candidate. By applying this method to a training set, the optimal gene signature was found by considering close to 765 billion candidate signatures. The 8-gene signature found for classifying the 2 aforementioned subtypes comprises TP63, CALML3, KRT5, PKP1, TESC, SPINK1, C9orf152, and KRT7. The signature achieved a high overall prediction accuracy of 0.936 when tested using 34 independent gene expression datasets obtained using different technologies and comprising 2556 adenocarcinoma and 1630 squamous cell carcinoma samples. Additionally, the signature performed well in clinically challenging cases, that is poorly differentiated tumors and specimens obtained from biopsies. In comparison with 2 previously reported signatures, our signature performed better in terms of overall accuracy and especially accuracy of classifying lung squamous cell carcinoma.

Conclusions: Our signature is easy to use and accurate regardless of the technology used to obtain the gene expression profiles. It performs well even in clinically challenging cases and thus can assist pathologists in diagnosis of the ambiguous cases.

Abstract Image

Abstract Image

Abstract Image

非小细胞肺癌主要亚型分类的8基因标记。
动机:非小细胞肺癌的主要亚型肺腺癌和肺鳞状细胞癌的精确诊断具有重要的现实意义,因为一些治疗方法是针对亚型的。然而,在某些情况下,通过常用的方法进行诊断,即使用免疫组织化学标记物对标本进行染色,可能具有挑战性。因此,需要有一种补充诊断的计算方法。在本文中,我们为此提出了一个基因标记。结果:我们开发了一种基于表达的方法,系统地提出了大量的候选基因签名,并找到了最佳的候选基因。将该方法应用于训练集,通过考虑近7650亿个候选签名,找到了最优的基因签名。用于上述两种亚型分类的8个基因特征包括TP63、CALML3、KRT5、PKP1、TESC、SPINK1、C9orf152和KRT7。当使用使用不同技术获得的34个独立基因表达数据集,包括2556个腺癌和1630个鳞状细胞癌样本进行测试时,该特征获得了0.936的高总体预测精度。此外,在临床上具有挑战性的情况下,即从活检中获得的低分化肿瘤和标本,该特征表现良好。与先前报道的2个签名相比,我们的签名在总体准确性方面表现更好,特别是在肺鳞状细胞癌分类的准确性方面。结论:无论使用何种技术获取基因表达谱,我们的标记都易于使用且准确。它表现良好,即使在临床上具有挑战性的情况下,因此可以协助病理学家在诊断模棱两可的情况下。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Informatics
Cancer Informatics Medicine-Oncology
CiteScore
3.00
自引率
5.00%
发文量
30
审稿时长
8 weeks
期刊介绍: The field of cancer research relies on advances in many other disciplines, including omics technology, mass spectrometry, radio imaging, computer science, and biostatistics. Cancer Informatics provides open access to peer-reviewed high-quality manuscripts reporting bioinformatics analysis of molecular genetics and/or clinical data pertaining to cancer, emphasizing the use of machine learning, artificial intelligence, statistical algorithms, advanced imaging techniques, data visualization, and high-throughput technologies. As the leading journal dedicated exclusively to the report of the use of computational methods in cancer research and practice, Cancer Informatics leverages methodological improvements in systems biology, genomics, proteomics, metabolomics, and molecular biochemistry into the fields of cancer detection, treatment, classification, risk-prediction, prevention, outcome, and modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信