Leonardo Cobuccio, Arnor I Sigurdsson, Kajsa-Lotta Georgii Hellberg, Morten Dybdahl Krebs, Jonas Meisner, Thomas Werge, Michael E Benros, Andrew J Schork, Simon Rasmussen
{"title":"基于深度学习的多基因评分提高了精神疾病预测的通用性。","authors":"Leonardo Cobuccio, Arnor I Sigurdsson, Kajsa-Lotta Georgii Hellberg, Morten Dybdahl Krebs, Jonas Meisner, Thomas Werge, Michael E Benros, Andrew J Schork, Simon Rasmussen","doi":"10.1101/2025.05.05.25326794","DOIUrl":null,"url":null,"abstract":"<p><p>Polygenic scores (PGSs) have emerged as promising tools for predicting complex traits from genetic data, however, their predictive performance for psychiatric disorders remains limited and the added value of deep learning (DL) over linear models is underexplored. In this study, we compared our DL model, Genome-Local-Net (GLN), with the linear model bigstatsr in predicting five psychiatric disorders-ADHD, ASD, BIP, MDD, and SCZ-using individual-level genotype data. We further assessed whether combining these <i>internal</i> (individual-based) PGSs with <i>external</i> (GWAS-derived) PGSs and family genetic risk scores (FGRSs) could improve prediction additively or synergistically. While GLN and bigstatsr performed similarly in-sample, GLN showed better generalization on an out-of-sample replication set for ADHD, ASD, and MDD, with an average AUROC gain of 0.026. Integrating <i>internal</i>, <i>external</i>, and family-based scores significantly improved ADHD prediction, though DL-based integration provided no consistent advantage over logistic models. These findings suggest that while DL may enhance generalizability for specific psychiatric traits, linear models remain competitive and effective for genetic risk prediction.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083566/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep learning-based polygenic scores enhance generalizability of psychiatric disorders prediction.\",\"authors\":\"Leonardo Cobuccio, Arnor I Sigurdsson, Kajsa-Lotta Georgii Hellberg, Morten Dybdahl Krebs, Jonas Meisner, Thomas Werge, Michael E Benros, Andrew J Schork, Simon Rasmussen\",\"doi\":\"10.1101/2025.05.05.25326794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Polygenic scores (PGSs) have emerged as promising tools for predicting complex traits from genetic data, however, their predictive performance for psychiatric disorders remains limited and the added value of deep learning (DL) over linear models is underexplored. In this study, we compared our DL model, Genome-Local-Net (GLN), with the linear model bigstatsr in predicting five psychiatric disorders-ADHD, ASD, BIP, MDD, and SCZ-using individual-level genotype data. We further assessed whether combining these <i>internal</i> (individual-based) PGSs with <i>external</i> (GWAS-derived) PGSs and family genetic risk scores (FGRSs) could improve prediction additively or synergistically. While GLN and bigstatsr performed similarly in-sample, GLN showed better generalization on an out-of-sample replication set for ADHD, ASD, and MDD, with an average AUROC gain of 0.026. Integrating <i>internal</i>, <i>external</i>, and family-based scores significantly improved ADHD prediction, though DL-based integration provided no consistent advantage over logistic models. These findings suggest that while DL may enhance generalizability for specific psychiatric traits, linear models remain competitive and effective for genetic risk prediction.</p>\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083566/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.05.05.25326794\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.05.05.25326794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep learning-based polygenic scores enhance generalizability of psychiatric disorders prediction.
Polygenic scores (PGSs) have emerged as promising tools for predicting complex traits from genetic data, however, their predictive performance for psychiatric disorders remains limited and the added value of deep learning (DL) over linear models is underexplored. In this study, we compared our DL model, Genome-Local-Net (GLN), with the linear model bigstatsr in predicting five psychiatric disorders-ADHD, ASD, BIP, MDD, and SCZ-using individual-level genotype data. We further assessed whether combining these internal (individual-based) PGSs with external (GWAS-derived) PGSs and family genetic risk scores (FGRSs) could improve prediction additively or synergistically. While GLN and bigstatsr performed similarly in-sample, GLN showed better generalization on an out-of-sample replication set for ADHD, ASD, and MDD, with an average AUROC gain of 0.026. Integrating internal, external, and family-based scores significantly improved ADHD prediction, though DL-based integration provided no consistent advantage over logistic models. These findings suggest that while DL may enhance generalizability for specific psychiatric traits, linear models remain competitive and effective for genetic risk prediction.