[Construction and discussion of risk prediction model for allergic asthma in children].

Q3 Medicine
J Fan, J M Xu, C H Zhu, H Wang
{"title":"[Construction and discussion of risk prediction model for allergic asthma in children].","authors":"J Fan, J M Xu, C H Zhu, H Wang","doi":"10.3760/cma.j.cn112150-20241014-00812","DOIUrl":null,"url":null,"abstract":"<p><p>A prediction model for the risk of childhood allergic asthma was established through the analysis of public datasets. By using bioinformatics analysis methods, two datasets, GSE40732 and GSE40888, were selected, which included the whole-genome expression profile data of 222 children. Among them, GSE40732 was used as the training dataset to detect differentially expressed genes in peripheral blood mononuclear cells of children with the disease, and the master regulator analysis (MRA) algorithm was used to screen the master regulator genes in the inflammation-related pathway (GO: 0006954). After obtaining the master regulator genes, the expression of these master regulator genes in the GSE40732 and GSE40888 datasets was detected, and a prediction model was constructed through logistic regression, based on which risk scores were assigned to children. By comparing the risk scores of healthy children and children with the disease, the area under the curve (AUC) was used to evaluate the classification performance of the model. The average value of the risk scores of all children with the disease output by the model was calculated as the threshold. According to this threshold, the children with the disease in the two datasets were divided into high-risk and low-risk groups. The CIBERSORT algorithm was applied to analyze the infiltration of immune cells in the high-risk and low-risk groups, and the enrichment analysis of signaling pathways was completed using the msigdbr package in R software. The results showed that compared with healthy children, there were 377 up-regulated genes and 255 down-regulated genes in the peripheral blood mono-nuclear cells of children with the disease. The MRA algorithm analysis showed that there were five genes (<i>MUC5B</i>, <i>CST4</i>, <i>CCR7</i>, <i>TNF-α</i>, and <i>THBS1</i>) that were the master regulator genes in the regulatory network. Risk score=<i>MUC5B</i>×3.47<i>+CST4</i>×2.17<i>+CCR7</i>×0.59<i>+TNF</i>-<i>α</i>×0.54<i>+THBS1</i>×1.67. The AUC in the GSE40732 and GSE40888 datasets were 0.874 and 0.682, respectively. Compared with the low-risk group, the resting memory CD4<sup>+</sup>T cells and regulatory T cells in the peripheral blood of children with the disease in the high-risk group significantly decreased (<i>P</i><0.05), and both the IL-33 and IL-13 pathways were highly enriched. In conclusion, the model constructed in this study has a good predictive efficiency for the risk of allergic asthma and also has a certain effect on risk stratification.</p>","PeriodicalId":24033,"journal":{"name":"中华预防医学杂志","volume":"59 6","pages":"864-871"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华预防医学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112150-20241014-00812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

A prediction model for the risk of childhood allergic asthma was established through the analysis of public datasets. By using bioinformatics analysis methods, two datasets, GSE40732 and GSE40888, were selected, which included the whole-genome expression profile data of 222 children. Among them, GSE40732 was used as the training dataset to detect differentially expressed genes in peripheral blood mononuclear cells of children with the disease, and the master regulator analysis (MRA) algorithm was used to screen the master regulator genes in the inflammation-related pathway (GO: 0006954). After obtaining the master regulator genes, the expression of these master regulator genes in the GSE40732 and GSE40888 datasets was detected, and a prediction model was constructed through logistic regression, based on which risk scores were assigned to children. By comparing the risk scores of healthy children and children with the disease, the area under the curve (AUC) was used to evaluate the classification performance of the model. The average value of the risk scores of all children with the disease output by the model was calculated as the threshold. According to this threshold, the children with the disease in the two datasets were divided into high-risk and low-risk groups. The CIBERSORT algorithm was applied to analyze the infiltration of immune cells in the high-risk and low-risk groups, and the enrichment analysis of signaling pathways was completed using the msigdbr package in R software. The results showed that compared with healthy children, there were 377 up-regulated genes and 255 down-regulated genes in the peripheral blood mono-nuclear cells of children with the disease. The MRA algorithm analysis showed that there were five genes (MUC5B, CST4, CCR7, TNF-α, and THBS1) that were the master regulator genes in the regulatory network. Risk score=MUC5B×3.47+CST4×2.17+CCR7×0.59+TNF-α×0.54+THBS1×1.67. The AUC in the GSE40732 and GSE40888 datasets were 0.874 and 0.682, respectively. Compared with the low-risk group, the resting memory CD4+T cells and regulatory T cells in the peripheral blood of children with the disease in the high-risk group significantly decreased (P<0.05), and both the IL-33 and IL-13 pathways were highly enriched. In conclusion, the model constructed in this study has a good predictive efficiency for the risk of allergic asthma and also has a certain effect on risk stratification.

[儿童过敏性哮喘风险预测模型的构建与探讨]。
通过对公共数据集的分析,建立儿童过敏性哮喘风险预测模型。采用生物信息学分析方法,选择GSE40732和GSE40888两个数据集,包含222例儿童的全基因组表达谱数据。其中,以GSE40732作为训练数据集检测患儿外周血单核细胞差异表达基因,并采用主调控分析(MRA)算法筛选炎症相关通路中的主调控基因(GO: 0006954)。获取主调控基因后,检测这些主调控基因在GSE40732和GSE40888数据集中的表达情况,并通过logistic回归构建预测模型,根据预测模型对儿童进行风险评分。通过比较健康儿童和患病儿童的风险评分,采用曲线下面积(area under the curve, AUC)来评价模型的分类效果。将模型输出的所有患病儿童的风险评分的平均值作为阈值。根据这一阈值,将两个数据集中的患病儿童分为高危组和低危组。采用CIBERSORT算法分析高危组和低危组免疫细胞浸润情况,利用R软件中的msigdbr包完成信号通路富集分析。结果显示,与健康儿童相比,患儿外周血单核细胞中有377个上调基因,255个下调基因。MRA算法分析显示,在该调控网络中有MUC5B、CST4、CCR7、TNF-α和THBS1 5个基因是主调控基因。风险评分= MUC5B×3.47 + CST4 CCR7××2.17 + 0.59 + TNF -α+ THBS1×0.54×1.67。GSE40732和GSE40888数据集的AUC分别为0.874和0.682。与低危组比较,高危组患儿外周血静息记忆性CD4+T细胞和调节性T细胞水平明显降低(P
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
中华预防医学杂志
中华预防医学杂志 Medicine-Medicine (all)
CiteScore
1.20
自引率
0.00%
发文量
12678
期刊介绍: Chinese Journal of Preventive Medicine (CJPM), the successor to Chinese Health Journal , was initiated on October 1, 1953. In 1960, it was amalgamated with the Chinese Medical Journal and the Journal of Medical History and Health Care , and thereafter, was renamed as People’s Care . On November 25, 1978, the publication was denominated as Chinese Journal of Preventive Medicine . The contents of CJPM deal with a wide range of disciplines and technologies including epidemiology, environmental health, nutrition and food hygiene, occupational health, hygiene for children and adolescents, radiological health, toxicology, biostatistics, social medicine, pathogenic and epidemiological research in malignant tumor, surveillance and immunization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信