Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8.

IF 1.3 4区 生物学 Q4 GENETICS & HEREDITY
Abbas Karimi-Fard, Abbas Saidi, Masoud Tohidfar, Seyedeh Noushin Emami
{"title":"Integrative machine learning and RT-qPCR analysis identify key stress-responsive genes in Thermus thermophilus HB8.","authors":"Abbas Karimi-Fard, Abbas Saidi, Masoud Tohidfar, Seyedeh Noushin Emami","doi":"10.1007/s10709-025-00243-6","DOIUrl":null,"url":null,"abstract":"<p><p>Bacteria are constantly exposed to diverse environmental stresses, necessitating complex adaptive mechanisms for survival. Thermus thermophilus, a thermophilic extremophile, serves as an excellent model for investigating these responses due to its remarkable resilience to harsh conditions. Recent advances in artificial intelligence, particularly in machine learning, have transformed the identification of novel stress-responsive biomarkers. In this study, we analyzed transcriptomic data from 65 T. thermophilus HB8 samples subjected to various abiotic stresses to identify key genes involved in stress adaptation. We applied a suite of supervised machine learning algorithms to classify samples and prioritize informative features. Among the tested models, Extreme Gradient Boosting (XGBoost) and Random Forest (RF) achieved the highest classification performance, with XGBoost attaining perfect discrimination between stressed and control samples (AUC = 1.00) and RF closely following (AUC = 0.99). Feature importance analysis consistently identified three candidate genes: TTHA0029, TTHA1720, and TTHA1359. Functional validation using RT-qPCR confirmed the significant upregulation of TTHA0029 and TTHA1720 under salt and hydrogen peroxide stress, suggesting roles in redox regulation and ionic homeostasis. Phylogenetic analysis further revealed the specificity of these genes to the Thermus genus. Overall, our findings highlight central molecular players in stress tolerance in T. thermophilus and demonstrate the utility of machine learning in biomarker discovery. The identified genes, TTHA0029 and TTHA1720, may serve as promising targets for genetic engineering to improve stress resilience in both crops and industrially relevant microorganisms.</p>","PeriodicalId":55121,"journal":{"name":"Genetica","volume":"153 1","pages":"28"},"PeriodicalIF":1.3000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetica","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10709-025-00243-6","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Bacteria are constantly exposed to diverse environmental stresses, necessitating complex adaptive mechanisms for survival. Thermus thermophilus, a thermophilic extremophile, serves as an excellent model for investigating these responses due to its remarkable resilience to harsh conditions. Recent advances in artificial intelligence, particularly in machine learning, have transformed the identification of novel stress-responsive biomarkers. In this study, we analyzed transcriptomic data from 65 T. thermophilus HB8 samples subjected to various abiotic stresses to identify key genes involved in stress adaptation. We applied a suite of supervised machine learning algorithms to classify samples and prioritize informative features. Among the tested models, Extreme Gradient Boosting (XGBoost) and Random Forest (RF) achieved the highest classification performance, with XGBoost attaining perfect discrimination between stressed and control samples (AUC = 1.00) and RF closely following (AUC = 0.99). Feature importance analysis consistently identified three candidate genes: TTHA0029, TTHA1720, and TTHA1359. Functional validation using RT-qPCR confirmed the significant upregulation of TTHA0029 and TTHA1720 under salt and hydrogen peroxide stress, suggesting roles in redox regulation and ionic homeostasis. Phylogenetic analysis further revealed the specificity of these genes to the Thermus genus. Overall, our findings highlight central molecular players in stress tolerance in T. thermophilus and demonstrate the utility of machine learning in biomarker discovery. The identified genes, TTHA0029 and TTHA1720, may serve as promising targets for genetic engineering to improve stress resilience in both crops and industrially relevant microorganisms.

综合机器学习和RT-qPCR分析鉴定了嗜热热菌HB8的关键应激反应基因。
细菌不断暴露于不同的环境压力下,需要复杂的适应机制来生存。嗜热热菌是一种嗜热的极端微生物,由于其对恶劣条件的显著恢复能力,它可以作为研究这些反应的极好模型。人工智能的最新进展,特别是在机器学习方面,已经改变了对新型应激反应生物标志物的识别。在这项研究中,我们分析了65份受各种非生物胁迫的嗜热T. HB8样本的转录组学数据,以确定参与胁迫适应的关键基因。我们应用了一套有监督的机器学习算法来对样本进行分类并对信息特征进行优先排序。在测试的模型中,极端梯度增强(Extreme Gradient boost, XGBoost)和随机森林(Random Forest, RF)的分类性能最高,其中XGBoost在压力样本和对照样本之间获得了完美的区分(AUC = 1.00), RF紧随其后(AUC = 0.99)。特征重要性分析一致确定了三个候选基因:TTHA0029、TTHA1720和TTHA1359。RT-qPCR功能验证证实,TTHA0029和TTHA1720在盐和过氧化氢胁迫下显著上调,提示其参与氧化还原调控和离子稳态。系统发育分析进一步揭示了这些基因对热蝇属的特异性。总的来说,我们的研究结果突出了嗜热t菌耐受性的核心分子,并证明了机器学习在生物标志物发现中的实用性。所鉴定的基因TTHA0029和TTHA1720可能成为基因工程提高作物和工业相关微生物抗逆性的有希望的靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genetica
Genetica 生物-遗传学
CiteScore
2.70
自引率
0.00%
发文量
32
审稿时长
>12 weeks
期刊介绍: Genetica publishes papers dealing with genetics, genomics, and evolution. Our journal covers novel advances in the fields of genomics, conservation genetics, genotype-phenotype interactions, evo-devo, population and quantitative genetics, and biodiversity. Genetica publishes original research articles addressing novel conceptual, experimental, and theoretical issues in these areas, whatever the taxon considered. Biomedical papers and papers on breeding animal and plant genetics are not within the scope of Genetica, unless framed in an evolutionary context. Recent advances in genetics, genomics and evolution are also published in thematic issues and synthesis papers published by experts in the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信