复发性小儿胶质母细胞瘤的细胞异质性特征:机器学习增强型单细胞RNA-Seq揭示调控特征

IF 0.5 Q4 GENETICS & HEREDITY
Shikha Suman , Anurag Kulshrestha
{"title":"复发性小儿胶质母细胞瘤的细胞异质性特征:机器学习增强型单细胞RNA-Seq揭示调控特征","authors":"Shikha Suman ,&nbsp;Anurag Kulshrestha","doi":"10.1016/j.humgen.2024.201300","DOIUrl":null,"url":null,"abstract":"<div><p>The most common oncologic cause of mortality in children is pediatric glioblastoma, an extremely dangerous brain tumor. The tumor progress is almost inevitable and recurs after first-line standard care. Because surgical resection is often more effective when tumors are localized and smaller, early identification and action may be essential to assure favourable outcomes for the recurring disease. This study aims to employ single-cell RNA-Sequencing data (scRNA-Seq data) for clustering and explainable Artificial intelligence framework to find gene biomarkers and signature cell types for the diagnosis and prognosis of reoccurring pediatric glioblastoma. Distinct cell types and statistically significant DEGs were found using scRNA-Seq data retrieved from the Gene Expression Omnibus database. Random forest (RF) and extreme gradient boosting (XGBoost) machine learning (ML) classifiers were constructed to select genes significantly contributing to the disease using Shapley (SHAP) values, an explainable artificial intelligence (EAI) framework. Potential biomarkers were chosen based on the shared genes among statistically discovered DEGs and SHAP-based relevance. B cells, macrophages, CD8+ T cells, T cells, and NK cells were identified as distinct cell types, which played an essential role in disease recurrence. Also, five significant genes, namely HMGB2, H2AFZ, HIST1H4C, KIAA0101, and DUT, were screened and in silico validated through survival analysis and feature plot, hence, proposed as biomarkers for recurring pediatric glioblastoma. Utilising these five genes may improve disease prognosis and provide a crucial understanding of the molecular causes of recurrent pediatric glioblastoma.</p></div>","PeriodicalId":29686,"journal":{"name":"Human Gene","volume":"41 ","pages":"Article 201300"},"PeriodicalIF":0.5000,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Characterization of cellular heterogeneity in recurrent pediatric glioblastoma: Machine learning-enhanced single-cell RNA-Seq unveils regulatory signatures\",\"authors\":\"Shikha Suman ,&nbsp;Anurag Kulshrestha\",\"doi\":\"10.1016/j.humgen.2024.201300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The most common oncologic cause of mortality in children is pediatric glioblastoma, an extremely dangerous brain tumor. The tumor progress is almost inevitable and recurs after first-line standard care. Because surgical resection is often more effective when tumors are localized and smaller, early identification and action may be essential to assure favourable outcomes for the recurring disease. This study aims to employ single-cell RNA-Sequencing data (scRNA-Seq data) for clustering and explainable Artificial intelligence framework to find gene biomarkers and signature cell types for the diagnosis and prognosis of reoccurring pediatric glioblastoma. Distinct cell types and statistically significant DEGs were found using scRNA-Seq data retrieved from the Gene Expression Omnibus database. Random forest (RF) and extreme gradient boosting (XGBoost) machine learning (ML) classifiers were constructed to select genes significantly contributing to the disease using Shapley (SHAP) values, an explainable artificial intelligence (EAI) framework. Potential biomarkers were chosen based on the shared genes among statistically discovered DEGs and SHAP-based relevance. B cells, macrophages, CD8+ T cells, T cells, and NK cells were identified as distinct cell types, which played an essential role in disease recurrence. Also, five significant genes, namely HMGB2, H2AFZ, HIST1H4C, KIAA0101, and DUT, were screened and in silico validated through survival analysis and feature plot, hence, proposed as biomarkers for recurring pediatric glioblastoma. Utilising these five genes may improve disease prognosis and provide a crucial understanding of the molecular causes of recurrent pediatric glioblastoma.</p></div>\",\"PeriodicalId\":29686,\"journal\":{\"name\":\"Human Gene\",\"volume\":\"41 \",\"pages\":\"Article 201300\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2024-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Gene\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2773044124000445\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Gene","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2773044124000445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

导致儿童死亡的最常见肿瘤原因是小儿胶质母细胞瘤,这是一种极其危险的脑肿瘤。肿瘤几乎不可避免地会发展,并在一线标准治疗后复发。由于手术切除通常在肿瘤局部较小的情况下更为有效,因此早期识别和采取行动对于确保复发疾病的良好治疗效果至关重要。本研究旨在利用单细胞RNA测序数据(scRNA-Seq数据)进行聚类和可解释人工智能框架,寻找基因生物标记物和特征细胞类型,用于诊断和预后复发的小儿胶质母细胞瘤。利用从基因表达总库(Gene Expression Omnibus)数据库中检索到的 scRNA-Seq 数据,发现了不同的细胞类型和具有统计学意义的 DEGs。利用可解释人工智能(EAI)框架沙普利(SHAP)值构建了随机森林(RF)和极端梯度提升(XGBoost)机器学习(ML)分类器,以选择对疾病有显著贡献的基因。根据统计发现的 DEGs 中的共享基因和基于 SHAP 的相关性选择潜在的生物标记物。B细胞、巨噬细胞、CD8+ T细胞、T细胞和NK细胞被确定为不同的细胞类型,它们在疾病复发中起着至关重要的作用。此外,研究人员还筛选出了五个重要基因,即HMGB2、H2AFZ、HIST1H4C、KIAA0101和DUT,并通过生存分析和特征图进行了硅验证,因此提出将这五个基因作为小儿胶质母细胞瘤复发的生物标记物。利用这五个基因可改善疾病预后,并为了解复发性小儿胶质母细胞瘤的分子原因提供重要依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Characterization of cellular heterogeneity in recurrent pediatric glioblastoma: Machine learning-enhanced single-cell RNA-Seq unveils regulatory signatures

The most common oncologic cause of mortality in children is pediatric glioblastoma, an extremely dangerous brain tumor. The tumor progress is almost inevitable and recurs after first-line standard care. Because surgical resection is often more effective when tumors are localized and smaller, early identification and action may be essential to assure favourable outcomes for the recurring disease. This study aims to employ single-cell RNA-Sequencing data (scRNA-Seq data) for clustering and explainable Artificial intelligence framework to find gene biomarkers and signature cell types for the diagnosis and prognosis of reoccurring pediatric glioblastoma. Distinct cell types and statistically significant DEGs were found using scRNA-Seq data retrieved from the Gene Expression Omnibus database. Random forest (RF) and extreme gradient boosting (XGBoost) machine learning (ML) classifiers were constructed to select genes significantly contributing to the disease using Shapley (SHAP) values, an explainable artificial intelligence (EAI) framework. Potential biomarkers were chosen based on the shared genes among statistically discovered DEGs and SHAP-based relevance. B cells, macrophages, CD8+ T cells, T cells, and NK cells were identified as distinct cell types, which played an essential role in disease recurrence. Also, five significant genes, namely HMGB2, H2AFZ, HIST1H4C, KIAA0101, and DUT, were screened and in silico validated through survival analysis and feature plot, hence, proposed as biomarkers for recurring pediatric glioblastoma. Utilising these five genes may improve disease prognosis and provide a crucial understanding of the molecular causes of recurrent pediatric glioblastoma.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Human Gene
Human Gene Biochemistry, Genetics and Molecular Biology (General), Genetics
CiteScore
1.60
自引率
0.00%
发文量
0
审稿时长
54 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信