{"title":"Pointwise prediction of protein diffusive properties using machine learning.","authors":"Rasched Haidari, Achillefs N Kapanidis","doi":"10.1088/2515-7647/adede9","DOIUrl":null,"url":null,"abstract":"<p><p>The understanding of cellular mechanisms benefits substantially from accurate determination of protein diffusive properties. Prior work in this field primarily focuses on traditional methods, such as mean square displacements, for calculation of protein diffusion coefficients and biological states. This proves difficult and error-prone for proteins undergoing heterogeneous behaviour, particularly in complex environments, limiting the exploration of new biological behaviours. The importance of determining protein diffusion coefficients, anomalous exponents, and biological behaviours led to the Anomalous Diffusion Challenge 2024, exploring machine learning methods to infer these variables in heterogeneous trajectories with time-dependent changepoints. In response to the challenge, we present M3, a machine learning method for pointwise inference of diffusive coefficients, anomalous exponents, and states along noisy heterogenous protein trajectories. M3 makes use of long short-term memory cells to achieve small mean absolute errors for the diffusion coefficient and anomalous exponent alongside high state accuracies (>90%). Subsequently, we implement changepoint detection to determine timepoints at which protein behaviour changes. M3 removes the need for expert fine-tuning required in most conventional statistical methods while being computationally inexpensive to train. The model finished in the Top 5 of the Anomalous Diffusive Challenge 2024, with small improvements made since challenge closure.</p>","PeriodicalId":44008,"journal":{"name":"Journal of Physics-Photonics","volume":"7 3","pages":"035025"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12269547/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Physics-Photonics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2515-7647/adede9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
The understanding of cellular mechanisms benefits substantially from accurate determination of protein diffusive properties. Prior work in this field primarily focuses on traditional methods, such as mean square displacements, for calculation of protein diffusion coefficients and biological states. This proves difficult and error-prone for proteins undergoing heterogeneous behaviour, particularly in complex environments, limiting the exploration of new biological behaviours. The importance of determining protein diffusion coefficients, anomalous exponents, and biological behaviours led to the Anomalous Diffusion Challenge 2024, exploring machine learning methods to infer these variables in heterogeneous trajectories with time-dependent changepoints. In response to the challenge, we present M3, a machine learning method for pointwise inference of diffusive coefficients, anomalous exponents, and states along noisy heterogenous protein trajectories. M3 makes use of long short-term memory cells to achieve small mean absolute errors for the diffusion coefficient and anomalous exponent alongside high state accuracies (>90%). Subsequently, we implement changepoint detection to determine timepoints at which protein behaviour changes. M3 removes the need for expert fine-tuning required in most conventional statistical methods while being computationally inexpensive to train. The model finished in the Top 5 of the Anomalous Diffusive Challenge 2024, with small improvements made since challenge closure.