Jun He, Qingqing Hu, Sifeng Wang, Xiaohui Gong, Ni Gan, Li Huang, Haofeng Chen, Jie Dai, Hong Yu, Shuanglin Xiang, Xiangwen Peng
{"title":"Identifying Immune-related Molecular Biomarkers in Autism Spectrum Disorder Using Data-independent Acquisition Proteomics and Machine Learning.","authors":"Jun He, Qingqing Hu, Sifeng Wang, Xiaohui Gong, Ni Gan, Li Huang, Haofeng Chen, Jie Dai, Hong Yu, Shuanglin Xiang, Xiangwen Peng","doi":"10.3791/68949","DOIUrl":null,"url":null,"abstract":"<p><p>This study presents a reproducible protocol for identifying serum protein biomarkers associated with autism spectrum disorder (ASD) using data-independent acquisition (DIA) mass spectrometry combined with machine learning (ML). DIA enables unbiased, high-resolution profiling of the serum proteome, including low-abundance proteins, while ensuring reproducibility across samples. ML approaches were applied to select diagnostically informative protein panels and improve model robustness. The analysis included serum from 99 children with ASD and 70 age-matched controls. High-abundance proteins were depleted, peptides were prepared using standardized digestion and fractionation procedures, and DIA was performed on a high-resolution mass spectrometer. Data processing and quantification identified differentially expressed proteins, which underwent functional enrichment analysis. Eight immune-related proteins emerged as strong candidates for biomarker development. A logistic regression model trained on these proteins achieved 95.27% accuracy, a Kappa value of 0.9025, and an AUC of 1.000 in cross-validation. These findings demonstrate the potential of DIA-based proteomics, combined with machine learning, as a robust framework for biomarker discovery in ASD and for adaptation in broader clinical research.</p>","PeriodicalId":48787,"journal":{"name":"Jove-Journal of Visualized Experiments","volume":" 223","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jove-Journal of Visualized Experiments","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.3791/68949","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents a reproducible protocol for identifying serum protein biomarkers associated with autism spectrum disorder (ASD) using data-independent acquisition (DIA) mass spectrometry combined with machine learning (ML). DIA enables unbiased, high-resolution profiling of the serum proteome, including low-abundance proteins, while ensuring reproducibility across samples. ML approaches were applied to select diagnostically informative protein panels and improve model robustness. The analysis included serum from 99 children with ASD and 70 age-matched controls. High-abundance proteins were depleted, peptides were prepared using standardized digestion and fractionation procedures, and DIA was performed on a high-resolution mass spectrometer. Data processing and quantification identified differentially expressed proteins, which underwent functional enrichment analysis. Eight immune-related proteins emerged as strong candidates for biomarker development. A logistic regression model trained on these proteins achieved 95.27% accuracy, a Kappa value of 0.9025, and an AUC of 1.000 in cross-validation. These findings demonstrate the potential of DIA-based proteomics, combined with machine learning, as a robust framework for biomarker discovery in ASD and for adaptation in broader clinical research.
期刊介绍:
JoVE, the Journal of Visualized Experiments, is the world''s first peer reviewed scientific video journal. Established in 2006, JoVE is devoted to publishing scientific research in a visual format to help researchers overcome two of the biggest challenges facing the scientific research community today; poor reproducibility and the time and labor intensive nature of learning new experimental techniques.