Joshua Pillai , Sophia Liu , Kijung Sung , Linda Shi , Chengbiao Wu
{"title":"Benchmarking AlphaMissense pathogenicity predictions against APP, PSEN1, and PSEN2 variants of unknown significance","authors":"Joshua Pillai , Sophia Liu , Kijung Sung , Linda Shi , Chengbiao Wu","doi":"10.1016/j.bbrep.2025.102049","DOIUrl":null,"url":null,"abstract":"<div><div>Alzheimer's disease (AD) is a progressive neurodegenerative disease characterized by progressive cognitive decline. Over 200 pathogenic mutations in <em>amyloid-β precursor protein (APP)</em>, <em>presenilin-1</em> (<em>PSEN1</em>), and <em>presenilin-2</em> (<em>PSEN2</em>), have been implicated in AD. Yet, many rare and common variants have not been completely classified as protective or benign, risk-modifiers, or pathogenic, which is important for research on the disease mechanisms and discovery of treatment methods. The majority of these variants are missense mutations, and there is an active need for computational approaches to accurately predict their molecular consequences. AlphaMissense (AM) is a novel technology that uses population frequency data along with structural and sequential contexts from AlphaFold to predict the pathogenicity of missense mutations. Herein, we sought to evaluate the capabilities of AM on 114 variants of unknown significance (VUS), including 56 missense variants of <em>PSEN1</em>, 25 of <em>APP</em>, and 33 of <em>PSEN2</em> by benchmarking its prediction against their respective Aβ isoform levels <em>in vitro</em>, respectively. We found that the AM scores correlated moderately well with the critical Aβ42/Aβ40 biomarker and Aβ40 levels in the transmembrane proteins compared to weaker correlations in traditional approaches, including Combined Annotation Dependent Depletion (CADD) v1.7, evolutionary model of variant effect (EVE), and Evolutionary Scale Modeling-1b (ESM-1B). Yet, there were non-significant correlations identified with Aβ42 levels in all models. Furthermore, we found that AM does not rely completely on structural contexts from AlphaFold2, as it accurately predicted the effects of known variants on residues with a low predicted local distance difference test (pLDDT) score. Additionally, based on the receiver operating characteristic-area under the curve analysis (ROC-AUC), we found that AM retained a high performance on 263 validated variants of these amyloidogenic genes, and performed the greatest compared to other models for the 114 VUS. We believe this is the first study to provide comprehensive characterization and validation of AM in comparison to the widely utilized pathogenicity scoring models for VUS involved in proteins implicated in AD.</div></div>","PeriodicalId":8771,"journal":{"name":"Biochemistry and Biophysics Reports","volume":"42 ","pages":"Article 102049"},"PeriodicalIF":2.3000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochemistry and Biophysics Reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405580825001360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disease characterized by progressive cognitive decline. Over 200 pathogenic mutations in amyloid-β precursor protein (APP), presenilin-1 (PSEN1), and presenilin-2 (PSEN2), have been implicated in AD. Yet, many rare and common variants have not been completely classified as protective or benign, risk-modifiers, or pathogenic, which is important for research on the disease mechanisms and discovery of treatment methods. The majority of these variants are missense mutations, and there is an active need for computational approaches to accurately predict their molecular consequences. AlphaMissense (AM) is a novel technology that uses population frequency data along with structural and sequential contexts from AlphaFold to predict the pathogenicity of missense mutations. Herein, we sought to evaluate the capabilities of AM on 114 variants of unknown significance (VUS), including 56 missense variants of PSEN1, 25 of APP, and 33 of PSEN2 by benchmarking its prediction against their respective Aβ isoform levels in vitro, respectively. We found that the AM scores correlated moderately well with the critical Aβ42/Aβ40 biomarker and Aβ40 levels in the transmembrane proteins compared to weaker correlations in traditional approaches, including Combined Annotation Dependent Depletion (CADD) v1.7, evolutionary model of variant effect (EVE), and Evolutionary Scale Modeling-1b (ESM-1B). Yet, there were non-significant correlations identified with Aβ42 levels in all models. Furthermore, we found that AM does not rely completely on structural contexts from AlphaFold2, as it accurately predicted the effects of known variants on residues with a low predicted local distance difference test (pLDDT) score. Additionally, based on the receiver operating characteristic-area under the curve analysis (ROC-AUC), we found that AM retained a high performance on 263 validated variants of these amyloidogenic genes, and performed the greatest compared to other models for the 114 VUS. We believe this is the first study to provide comprehensive characterization and validation of AM in comparison to the widely utilized pathogenicity scoring models for VUS involved in proteins implicated in AD.
期刊介绍:
Open access, online only, peer-reviewed international journal in the Life Sciences, established in 2014 Biochemistry and Biophysics Reports (BB Reports) publishes original research in all aspects of Biochemistry, Biophysics and related areas like Molecular and Cell Biology. BB Reports welcomes solid though more preliminary, descriptive and small scale results if they have the potential to stimulate and/or contribute to future research, leading to new insights or hypothesis. Primary criteria for acceptance is that the work is original, scientifically and technically sound and provides valuable knowledge to life sciences research. We strongly believe all results deserve to be published and documented for the advancement of science. BB Reports specifically appreciates receiving reports on: Negative results, Replication studies, Reanalysis of previous datasets.