Ulices Que-Salinas, Dulce Martinez-Peon, Gerardo Maximiliano Mendez, P Argüelles-Lucho, Angel D Reyes-Figueroa, Christian Quintus Scheckhuber
{"title":"Addressing the Problem of Lysine Glycation Prediction in Proteins via Recurrent Neural Networks.","authors":"Ulices Que-Salinas, Dulce Martinez-Peon, Gerardo Maximiliano Mendez, P Argüelles-Lucho, Angel D Reyes-Figueroa, Christian Quintus Scheckhuber","doi":"10.1155/bmri/2426944","DOIUrl":null,"url":null,"abstract":"<p><p>A distinguishing feature of the metabolic disorder diabetes involves elevated damage to cellular components. Glycation, in contrast to glycosylation, is regarded as a strictly nonenzymatic process that involves the reaction of sugars (e.g., glucose and fructose) and sugar-derived molecules (e.g., methylglyoxal) with amino groups of biologically highly relevant molecules, such as nucleic acids, lipids, and proteins. The primary form of alteration arises from the chemical interaction between glycating agents and proteinaceous arginine/cysteine/lysine residues. Glycation may result in the formation of advanced glycation end-products (AGEs) which are mostly detrimental and compromise the function of the target molecule irreversibly. There are no clear sequence motifs in proteins that allow a straightforward identification of potential glycation sites. However, the physicochemical properties of amino acids that flank the glycated residue seem to play a key role in determining if glycation occurs or not. Here, we used a curated version of the CPLM database to implement a recurrent neural network strategy for the classification of lysine glycation to better understand which of eight physicochemical properties might influence glycation more than others. By using the most promising property for the characterization of amino acids next to lysine sites, isoelectric point, it was possible to obtain a 59.6% accuracy for correctly predicting lysine glycation. When the properties mass and torsion angle were used together, the accuracy increased to approximately 60%. Overall, our approach contributes to the understanding of glycation principles and can aid the task of narrowing down possible sites of lysine glycation in protein targets for further analysis.</p>","PeriodicalId":9007,"journal":{"name":"BioMed Research International","volume":"2025 ","pages":"2426944"},"PeriodicalIF":2.3000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12436683/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioMed Research International","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1155/bmri/2426944","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
A distinguishing feature of the metabolic disorder diabetes involves elevated damage to cellular components. Glycation, in contrast to glycosylation, is regarded as a strictly nonenzymatic process that involves the reaction of sugars (e.g., glucose and fructose) and sugar-derived molecules (e.g., methylglyoxal) with amino groups of biologically highly relevant molecules, such as nucleic acids, lipids, and proteins. The primary form of alteration arises from the chemical interaction between glycating agents and proteinaceous arginine/cysteine/lysine residues. Glycation may result in the formation of advanced glycation end-products (AGEs) which are mostly detrimental and compromise the function of the target molecule irreversibly. There are no clear sequence motifs in proteins that allow a straightforward identification of potential glycation sites. However, the physicochemical properties of amino acids that flank the glycated residue seem to play a key role in determining if glycation occurs or not. Here, we used a curated version of the CPLM database to implement a recurrent neural network strategy for the classification of lysine glycation to better understand which of eight physicochemical properties might influence glycation more than others. By using the most promising property for the characterization of amino acids next to lysine sites, isoelectric point, it was possible to obtain a 59.6% accuracy for correctly predicting lysine glycation. When the properties mass and torsion angle were used together, the accuracy increased to approximately 60%. Overall, our approach contributes to the understanding of glycation principles and can aid the task of narrowing down possible sites of lysine glycation in protein targets for further analysis.
期刊介绍:
BioMed Research International is a peer-reviewed, Open Access journal that publishes original research articles, review articles, and clinical studies covering a wide range of subjects in life sciences and medicine. The journal is divided into 55 subject areas.