{"title":"[Construction and discussion of risk prediction model for allergic asthma in children].","authors":"J Fan, J M Xu, C H Zhu, H Wang","doi":"10.3760/cma.j.cn112150-20241014-00812","DOIUrl":null,"url":null,"abstract":"<p><p>A prediction model for the risk of childhood allergic asthma was established through the analysis of public datasets. By using bioinformatics analysis methods, two datasets, GSE40732 and GSE40888, were selected, which included the whole-genome expression profile data of 222 children. Among them, GSE40732 was used as the training dataset to detect differentially expressed genes in peripheral blood mononuclear cells of children with the disease, and the master regulator analysis (MRA) algorithm was used to screen the master regulator genes in the inflammation-related pathway (GO: 0006954). After obtaining the master regulator genes, the expression of these master regulator genes in the GSE40732 and GSE40888 datasets was detected, and a prediction model was constructed through logistic regression, based on which risk scores were assigned to children. By comparing the risk scores of healthy children and children with the disease, the area under the curve (AUC) was used to evaluate the classification performance of the model. The average value of the risk scores of all children with the disease output by the model was calculated as the threshold. According to this threshold, the children with the disease in the two datasets were divided into high-risk and low-risk groups. The CIBERSORT algorithm was applied to analyze the infiltration of immune cells in the high-risk and low-risk groups, and the enrichment analysis of signaling pathways was completed using the msigdbr package in R software. The results showed that compared with healthy children, there were 377 up-regulated genes and 255 down-regulated genes in the peripheral blood mono-nuclear cells of children with the disease. The MRA algorithm analysis showed that there were five genes (<i>MUC5B</i>, <i>CST4</i>, <i>CCR7</i>, <i>TNF-α</i>, and <i>THBS1</i>) that were the master regulator genes in the regulatory network. Risk score=<i>MUC5B</i>×3.47<i>+CST4</i>×2.17<i>+CCR7</i>×0.59<i>+TNF</i>-<i>α</i>×0.54<i>+THBS1</i>×1.67. The AUC in the GSE40732 and GSE40888 datasets were 0.874 and 0.682, respectively. Compared with the low-risk group, the resting memory CD4<sup>+</sup>T cells and regulatory T cells in the peripheral blood of children with the disease in the high-risk group significantly decreased (<i>P</i><0.05), and both the IL-33 and IL-13 pathways were highly enriched. In conclusion, the model constructed in this study has a good predictive efficiency for the risk of allergic asthma and also has a certain effect on risk stratification.</p>","PeriodicalId":24033,"journal":{"name":"中华预防医学杂志","volume":"59 6","pages":"864-871"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华预防医学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112150-20241014-00812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
A prediction model for the risk of childhood allergic asthma was established through the analysis of public datasets. By using bioinformatics analysis methods, two datasets, GSE40732 and GSE40888, were selected, which included the whole-genome expression profile data of 222 children. Among them, GSE40732 was used as the training dataset to detect differentially expressed genes in peripheral blood mononuclear cells of children with the disease, and the master regulator analysis (MRA) algorithm was used to screen the master regulator genes in the inflammation-related pathway (GO: 0006954). After obtaining the master regulator genes, the expression of these master regulator genes in the GSE40732 and GSE40888 datasets was detected, and a prediction model was constructed through logistic regression, based on which risk scores were assigned to children. By comparing the risk scores of healthy children and children with the disease, the area under the curve (AUC) was used to evaluate the classification performance of the model. The average value of the risk scores of all children with the disease output by the model was calculated as the threshold. According to this threshold, the children with the disease in the two datasets were divided into high-risk and low-risk groups. The CIBERSORT algorithm was applied to analyze the infiltration of immune cells in the high-risk and low-risk groups, and the enrichment analysis of signaling pathways was completed using the msigdbr package in R software. The results showed that compared with healthy children, there were 377 up-regulated genes and 255 down-regulated genes in the peripheral blood mono-nuclear cells of children with the disease. The MRA algorithm analysis showed that there were five genes (MUC5B, CST4, CCR7, TNF-α, and THBS1) that were the master regulator genes in the regulatory network. Risk score=MUC5B×3.47+CST4×2.17+CCR7×0.59+TNF-α×0.54+THBS1×1.67. The AUC in the GSE40732 and GSE40888 datasets were 0.874 and 0.682, respectively. Compared with the low-risk group, the resting memory CD4+T cells and regulatory T cells in the peripheral blood of children with the disease in the high-risk group significantly decreased (P<0.05), and both the IL-33 and IL-13 pathways were highly enriched. In conclusion, the model constructed in this study has a good predictive efficiency for the risk of allergic asthma and also has a certain effect on risk stratification.
期刊介绍:
Chinese Journal of Preventive Medicine (CJPM), the successor to Chinese Health Journal , was initiated on October 1, 1953. In 1960, it was amalgamated with the Chinese Medical Journal and the Journal of Medical History and Health Care , and thereafter, was renamed as People’s Care . On November 25, 1978, the publication was denominated as Chinese Journal of Preventive Medicine . The contents of CJPM deal with a wide range of disciplines and technologies including epidemiology, environmental health, nutrition and food hygiene, occupational health, hygiene for children and adolescents, radiological health, toxicology, biostatistics, social medicine, pathogenic and epidemiological research in malignant tumor, surveillance and immunization.