关键基因的鉴定和败血症识别机器学习模型的开发。

IF 5.4 3区 医学 Q2 CELL BIOLOGY
Zhonghao Li, Shengsong Chen, Nan Gao, Jie Chen, Ying Qin, Guoqiang Zhang
{"title":"关键基因的鉴定和败血症识别机器学习模型的开发。","authors":"Zhonghao Li, Shengsong Chen, Nan Gao, Jie Chen, Ying Qin, Guoqiang Zhang","doi":"10.1007/s00011-025-02068-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective and design: </strong>This study aims to identify key genes of sepsis and construct a model for sepsis identification through integrated multi-organ single-cell RNA sequencing (scRNA-seq) and machine learning.</p><p><strong>Material or subjects: </strong>Datasets downloaded from the Gene Expression Omnibus (GSE207363, GSE207651, GSE185263, GSE69063 and GSE134347) were used.</p><p><strong>Methods: </strong>ScRNA-seq data extracted from heart (GSE207363) and lung tissues (GSE207651) of septic mice were processed and analyzed using the Seurat package in R. Key genes were identified as present in both heart and lung tissues, resulting from the overlap of three analyses along with differential expression analyses. We then used support vector machine recursive feature elimination to construct a model for sepsis identification based on these key genes. The GSE185263 dataset was used for training, while GSE69063 and GSE134347 were used for testing. The accuracy of the model in identifying of sepsis was validated by analyzing the area under the receiver operating characteristic curve (AUROC) using the test datasets.</p><p><strong>Results: </strong>Thirteen genes were initially identified as key genes, and after translation to their human homologs, ten genes remained. The optimal SVM-RFE model incorporated eight of these genes (CAMP, CD74, HLA-DQA1, HLA-DQB1, HLA-DMA, HLA-DRB5, and LYZ). In the two test datasets, the AUROC value for the accuracy of the model in identifying of sepsis was 0.904 and 0.924, respectively.</p><p><strong>Conclusions: </strong>We have identified several key genes and developed a machine learning model for sepsis identification. Further studies are needed to validate our findings.</p>","PeriodicalId":13550,"journal":{"name":"Inflammation Research","volume":"74 1","pages":"100"},"PeriodicalIF":5.4000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of key genes and development of an identifying machine learning model for sepsis.\",\"authors\":\"Zhonghao Li, Shengsong Chen, Nan Gao, Jie Chen, Ying Qin, Guoqiang Zhang\",\"doi\":\"10.1007/s00011-025-02068-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective and design: </strong>This study aims to identify key genes of sepsis and construct a model for sepsis identification through integrated multi-organ single-cell RNA sequencing (scRNA-seq) and machine learning.</p><p><strong>Material or subjects: </strong>Datasets downloaded from the Gene Expression Omnibus (GSE207363, GSE207651, GSE185263, GSE69063 and GSE134347) were used.</p><p><strong>Methods: </strong>ScRNA-seq data extracted from heart (GSE207363) and lung tissues (GSE207651) of septic mice were processed and analyzed using the Seurat package in R. Key genes were identified as present in both heart and lung tissues, resulting from the overlap of three analyses along with differential expression analyses. We then used support vector machine recursive feature elimination to construct a model for sepsis identification based on these key genes. The GSE185263 dataset was used for training, while GSE69063 and GSE134347 were used for testing. The accuracy of the model in identifying of sepsis was validated by analyzing the area under the receiver operating characteristic curve (AUROC) using the test datasets.</p><p><strong>Results: </strong>Thirteen genes were initially identified as key genes, and after translation to their human homologs, ten genes remained. The optimal SVM-RFE model incorporated eight of these genes (CAMP, CD74, HLA-DQA1, HLA-DQB1, HLA-DMA, HLA-DRB5, and LYZ). In the two test datasets, the AUROC value for the accuracy of the model in identifying of sepsis was 0.904 and 0.924, respectively.</p><p><strong>Conclusions: </strong>We have identified several key genes and developed a machine learning model for sepsis identification. Further studies are needed to validate our findings.</p>\",\"PeriodicalId\":13550,\"journal\":{\"name\":\"Inflammation Research\",\"volume\":\"74 1\",\"pages\":\"100\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Inflammation Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00011-025-02068-7\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inflammation Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00011-025-02068-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的与设计:本研究旨在通过集成多器官单细胞RNA测序(scRNA-seq)和机器学习技术,鉴定脓毒症的关键基因,构建脓毒症鉴定模型。材料或研究对象:使用从Gene Expression Omnibus下载的数据集(GSE207363、GSE207651、GSE185263、GSE69063和GSE134347)。方法:从脓毒症小鼠的心脏(GSE207363)和肺组织(GSE207651)中提取ScRNA-seq数据,使用r中的Seurat包进行处理和分析,发现关键基因存在于心脏和肺组织中,这是由于三种分析的重叠以及差异表达分析。然后,我们使用支持向量机递归特征消去来构建基于这些关键基因的脓毒症识别模型。使用GSE185263数据集进行训练,使用GSE69063和GSE134347进行测试。通过分析受试者工作特征曲线(AUROC)下的面积,验证了该模型识别脓毒症的准确性。结果:13个基因最初被鉴定为关键基因,翻译成人类同源基因后,剩下10个基因。最优SVM-RFE模型包含了这些基因中的8个(CAMP、CD74、HLA-DQA1、HLA-DQB1、HLA-DMA、HLA-DRB5和LYZ)。在两个测试数据集中,该模型识别脓毒症的准确率AUROC值分别为0.904和0.924。结论:我们已经确定了几个关键基因,并开发了一种用于败血症鉴定的机器学习模型。需要进一步的研究来验证我们的发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of key genes and development of an identifying machine learning model for sepsis.

Objective and design: This study aims to identify key genes of sepsis and construct a model for sepsis identification through integrated multi-organ single-cell RNA sequencing (scRNA-seq) and machine learning.

Material or subjects: Datasets downloaded from the Gene Expression Omnibus (GSE207363, GSE207651, GSE185263, GSE69063 and GSE134347) were used.

Methods: ScRNA-seq data extracted from heart (GSE207363) and lung tissues (GSE207651) of septic mice were processed and analyzed using the Seurat package in R. Key genes were identified as present in both heart and lung tissues, resulting from the overlap of three analyses along with differential expression analyses. We then used support vector machine recursive feature elimination to construct a model for sepsis identification based on these key genes. The GSE185263 dataset was used for training, while GSE69063 and GSE134347 were used for testing. The accuracy of the model in identifying of sepsis was validated by analyzing the area under the receiver operating characteristic curve (AUROC) using the test datasets.

Results: Thirteen genes were initially identified as key genes, and after translation to their human homologs, ten genes remained. The optimal SVM-RFE model incorporated eight of these genes (CAMP, CD74, HLA-DQA1, HLA-DQB1, HLA-DMA, HLA-DRB5, and LYZ). In the two test datasets, the AUROC value for the accuracy of the model in identifying of sepsis was 0.904 and 0.924, respectively.

Conclusions: We have identified several key genes and developed a machine learning model for sepsis identification. Further studies are needed to validate our findings.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Inflammation Research
Inflammation Research 医学-免疫学
CiteScore
9.90
自引率
1.50%
发文量
134
审稿时长
3-8 weeks
期刊介绍: Inflammation Research (IR) publishes peer-reviewed papers on all aspects of inflammation and related fields including histopathology, immunological mechanisms, gene expression, mediators, experimental models, clinical investigations and the effect of drugs. Related fields are broadly defined and include for instance, allergy and asthma, shock, pain, joint damage, skin disease as well as clinical trials of relevant drugs.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信