Integrated Bioinformatics and Machine Learning Analysis Identify ACADL as a Potent Biomarker of Reactive Mesothelial Cells

IF 4.7 2区 医学 Q1 PATHOLOGY
Yige Yin , Qianwen Cui , Jiarong Zhao , Qiang Wu , Qiuyan Sun , Hong-qiang Wang , Wulin Yang
{"title":"Integrated Bioinformatics and Machine Learning Analysis Identify ACADL as a Potent Biomarker of Reactive Mesothelial Cells","authors":"Yige Yin ,&nbsp;Qianwen Cui ,&nbsp;Jiarong Zhao ,&nbsp;Qiang Wu ,&nbsp;Qiuyan Sun ,&nbsp;Hong-qiang Wang ,&nbsp;Wulin Yang","doi":"10.1016/j.ajpath.2024.03.013","DOIUrl":null,"url":null,"abstract":"<div><p>Mesothelial cells with reactive hyperplasia are difficult to distinguish from malignant mesothelioma cells based on cell morphology. This study aimed to identify and validate potential biomarkers that distinguish mesothelial cells from mesothelioma cells through machine learning combined with immunohistochemistry. It integrated the gene expression matrix from three Gene Expression Omnibus data sets (GSE2549, GSE12345, and GSE51024) to analyze the differently expressed genes between normal and mesothelioma tissues. Then, three machine learning algorithms, least absolute shrinkage and selection operator, support vector machine recursive feature elimination, and random forest were used to screen and obtain four shared candidate markers, including <em>ACADL</em>, <em>EMP2</em>, <em>GPD1L</em>, and <em>HMMR</em>. The receiver operating characteristic curve analysis showed that the area under the curve for distinguishing normal mesothelial cells from mesothelioma was 0.976, 0.943, 0.962, and 0.956, respectively. The expression and diagnostic performance of these candidate genes were validated in two additional independent data sets (GSE42977 and GSE112154), indicating that the performances of <em>ACADL</em>, <em>GPD1L</em>, and <em>HMMR</em> were consistent between the training and validation data sets. Finally, the optimal candidate marker <em>ACADL</em> was verified by immunohistochemistry assay. Acyl-CoA dehydrogenase long chain (ACADL) was stained strongly in mesothelial cells, especially for reactive hyperplasic mesothelial cells, but was negative in malignant mesothelioma cells. Therefore, <em>ACADL</em> has the potential to be used as a specific marker of reactive hyperplasic mesothelial cells in the differential diagnosis of mesothelioma.</p></div>","PeriodicalId":7623,"journal":{"name":"American Journal of Pathology","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Pathology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0002944024001615","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Mesothelial cells with reactive hyperplasia are difficult to distinguish from malignant mesothelioma cells based on cell morphology. This study aimed to identify and validate potential biomarkers that distinguish mesothelial cells from mesothelioma cells through machine learning combined with immunohistochemistry. It integrated the gene expression matrix from three Gene Expression Omnibus data sets (GSE2549, GSE12345, and GSE51024) to analyze the differently expressed genes between normal and mesothelioma tissues. Then, three machine learning algorithms, least absolute shrinkage and selection operator, support vector machine recursive feature elimination, and random forest were used to screen and obtain four shared candidate markers, including ACADL, EMP2, GPD1L, and HMMR. The receiver operating characteristic curve analysis showed that the area under the curve for distinguishing normal mesothelial cells from mesothelioma was 0.976, 0.943, 0.962, and 0.956, respectively. The expression and diagnostic performance of these candidate genes were validated in two additional independent data sets (GSE42977 and GSE112154), indicating that the performances of ACADL, GPD1L, and HMMR were consistent between the training and validation data sets. Finally, the optimal candidate marker ACADL was verified by immunohistochemistry assay. Acyl-CoA dehydrogenase long chain (ACADL) was stained strongly in mesothelial cells, especially for reactive hyperplasic mesothelial cells, but was negative in malignant mesothelioma cells. Therefore, ACADL has the potential to be used as a specific marker of reactive hyperplasic mesothelial cells in the differential diagnosis of mesothelioma.

Abstract Image

Abstract Image

综合生物信息学和机器学习分析确定 ACADL 是反应性间皮细胞的有效生物标志物。
根据细胞形态很难将反应性增生的间皮细胞与恶性间皮瘤细胞区分开来。本研究旨在通过机器学习与免疫组化相结合,识别并验证区分间皮细胞与间皮瘤细胞的潜在生物标记物。它整合了三个基因表达总库数据集(GSE2549、GSE12345 和 GSE51024)中的基因表达矩阵,分析了正常组织和间皮瘤组织中表达不同的基因。然后,使用最小绝对收缩和选择算子、支持向量机递归特征消除和随机森林三种机器学习算法筛选并获得了四个共享候选标记,包括ACADL、EMP2、GPD1L和HMMR。接受者操作特征曲线分析表明,区分正常间皮细胞和间皮瘤的曲线下面积分别为0.976、0.943、0.962和0.956。这些候选基因的表达和诊断性能在另外两个独立数据集(GSE42977 和 GSE112154)中得到了验证,表明 ACADL、GPD1L 和 HMMR 在训练数据集和验证数据集之间的性能是一致的。最后,通过免疫组化检测验证了最佳候选标记物 ACADL。乙酰辅酶脱氢酶长链(ACADL)在间皮细胞,尤其是反应性增生的间皮细胞中染色强烈,但在恶性间皮瘤细胞中呈阴性。因此,ACADL 有可能被用作间皮瘤鉴别诊断中反应性增生间皮细胞的特异性标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.40
自引率
0.00%
发文量
178
审稿时长
30 days
期刊介绍: The American Journal of Pathology, official journal of the American Society for Investigative Pathology, published by Elsevier, Inc., seeks high-quality original research reports, reviews, and commentaries related to the molecular and cellular basis of disease. The editors will consider basic, translational, and clinical investigations that directly address mechanisms of pathogenesis or provide a foundation for future mechanistic inquiries. Examples of such foundational investigations include data mining, identification of biomarkers, molecular pathology, and discovery research. Foundational studies that incorporate deep learning and artificial intelligence are also welcome. High priority is given to studies of human disease and relevant experimental models using molecular, cellular, and organismal approaches.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信