A three-gene random forest model for diagnosing idiopathic pulmonary fibrosis based on circadian rhythm-related genes in lung tissue.

Expert review of respiratory medicine Pub Date : 2023-12-01 Epub Date: 2024-01-31 DOI:10.1080/17476348.2024.2311262
Jie He, Jun Hu, Hairong Liu
{"title":"A three-gene random forest model for diagnosing idiopathic pulmonary fibrosis based on circadian rhythm-related genes in lung tissue.","authors":"Jie He, Jun Hu, Hairong Liu","doi":"10.1080/17476348.2024.2311262","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The disorder of circadian rhythm could be a key factor mediating fibrotic lung disease Therefore, our study aims to determine the diagnostic value of circadian rhythm-related genes (CRRGs) in IPF.</p><p><strong>Methods: </strong>We retrieved the data on CRRGs from previous studies and the GSE150910 dataset. The participants from the GSE150910 dataset were divided into training and internal validation sets. Next, we used several various bioinformatics methods and machine learning algorithms to screen genes. Next, we identified SEMA5A, COL7A1, and TUBB3, which were included in the random forest (RF) diagnostic model. Finally, external validation was conducted on data retrieved from the GSE184316 datasets.</p><p><strong>Results: </strong>The results revealed that the RF diagnostic model could diagnose patients with IPF in the internal validation set with the area under the ROC curve (AUC) value of 0.905 and in the external validation with the AUC value of 0.767. Furthermore, real-time quantitative PCR and western blotting results revealed a significant decrease in SEMA5A (<i>p</i> < 0.05) expression level and an increase in COL7A1 and TUBB3 expression levels in TGF-β1-treated normal human lung fibroblasts.</p><p><strong>Conclusion: </strong>We constructed an RF diagnostic model based on SEMA5A, COL7A1, and TUBB3 expression in lung tissue for diagnosing patients with IPF.</p>","PeriodicalId":94007,"journal":{"name":"Expert review of respiratory medicine","volume":" ","pages":"1307-1320"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert review of respiratory medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17476348.2024.2311262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/31 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The disorder of circadian rhythm could be a key factor mediating fibrotic lung disease Therefore, our study aims to determine the diagnostic value of circadian rhythm-related genes (CRRGs) in IPF.

Methods: We retrieved the data on CRRGs from previous studies and the GSE150910 dataset. The participants from the GSE150910 dataset were divided into training and internal validation sets. Next, we used several various bioinformatics methods and machine learning algorithms to screen genes. Next, we identified SEMA5A, COL7A1, and TUBB3, which were included in the random forest (RF) diagnostic model. Finally, external validation was conducted on data retrieved from the GSE184316 datasets.

Results: The results revealed that the RF diagnostic model could diagnose patients with IPF in the internal validation set with the area under the ROC curve (AUC) value of 0.905 and in the external validation with the AUC value of 0.767. Furthermore, real-time quantitative PCR and western blotting results revealed a significant decrease in SEMA5A (p < 0.05) expression level and an increase in COL7A1 and TUBB3 expression levels in TGF-β1-treated normal human lung fibroblasts.

Conclusion: We constructed an RF diagnostic model based on SEMA5A, COL7A1, and TUBB3 expression in lung tissue for diagnosing patients with IPF.

基于肺组织中昼夜节律相关基因诊断特发性肺纤维化的三基因随机森林模型。
背景:昼夜节律紊乱可能是导致肺纤维化疾病的关键因素:因此,我们的研究旨在确定昼夜节律相关基因(CRRGs)在 IPF 中的诊断价值:我们从以往的研究和 GSE150910 数据集中检索了有关 CRRGs 的数据。我们将 GSE150910 数据集中的参与者分为训练集和内部验证集。接下来,我们使用了多种生物信息学方法和机器学习算法来筛选基因。接着,我们确定了 SEMA5A、COL7A1 和 TUBB3,并将其纳入随机森林(RF)诊断模型。最后,我们对从 GSE184316 数据集中获取的数据进行了外部验证:结果表明,RF 诊断模型在内部验证集中可诊断出 IPF 患者,其 ROC 曲线下面积(AUC)值为 0.905,在外部验证中的 AUC 值为 0.767。此外,实时定量 PCR 和 Western 印迹检测结果显示,SEMA5A 的含量显著下降(p 结论:SEMA5A 的检测结果显示,IFF 的发病率明显增加:我们根据肺组织中 SEMA5A、COL7A1 和 TUBB3 的表达构建了一个射频诊断模型,用于诊断 IPF 患者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信