{"title":"Structural characterization of length-varying peptide sequences for peptide quantitative structure-activity relationship.","authors":"Y Zhang, K Li, Y Gan, P Zhou","doi":"10.1080/1062936X.2025.2552141","DOIUrl":null,"url":null,"abstract":"<p><p>Peptide quantitative structure-activity relationship (pQSAR) has been widely used in the computational peptidology community to model, predict and explain the activity and function of bioactive peptides. Various amino acid descriptors (AADs) have been developed to characterize the residue building blocks of peptides at sequence level. However, a significant issue is that the total number of AAD-characterized descriptors is proportional to peptide length, thus causing inconsistency in the resulting descriptor vector matrix for a panel of length-varying peptide sequences (LVPSs), which cannot be engaged in pQSAR modelling. Currently, only one AAD-based scaling approach, termed auto-cross covariance (ACC) that was proposed thirty years ago, is available for treating such issue. In this study, we described the second AAD-based multivariate method to do so, namely Residue Descriptor-Distance Vector (RDDV). The strategy characterizes a peptide sequence by using an inter-residue pseudo-interaction potential between different pre-assigned amino acid types involved in the sequence, which results in a given (invariable) number of descriptor parameters for different LVPSs. Here, the RDDV was tested, examined and validated in an in-house pQSAR-oriented bioactive peptide data cluster, which was explored systematically with combinations of different AADs and regression tools. We also compared RDDV with the traditional ACC in multiple aspects.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"36 8","pages":"727-751"},"PeriodicalIF":2.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SAR and QSAR in Environmental Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/1062936X.2025.2552141","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/10 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Peptide quantitative structure-activity relationship (pQSAR) has been widely used in the computational peptidology community to model, predict and explain the activity and function of bioactive peptides. Various amino acid descriptors (AADs) have been developed to characterize the residue building blocks of peptides at sequence level. However, a significant issue is that the total number of AAD-characterized descriptors is proportional to peptide length, thus causing inconsistency in the resulting descriptor vector matrix for a panel of length-varying peptide sequences (LVPSs), which cannot be engaged in pQSAR modelling. Currently, only one AAD-based scaling approach, termed auto-cross covariance (ACC) that was proposed thirty years ago, is available for treating such issue. In this study, we described the second AAD-based multivariate method to do so, namely Residue Descriptor-Distance Vector (RDDV). The strategy characterizes a peptide sequence by using an inter-residue pseudo-interaction potential between different pre-assigned amino acid types involved in the sequence, which results in a given (invariable) number of descriptor parameters for different LVPSs. Here, the RDDV was tested, examined and validated in an in-house pQSAR-oriented bioactive peptide data cluster, which was explored systematically with combinations of different AADs and regression tools. We also compared RDDV with the traditional ACC in multiple aspects.
期刊介绍:
SAR and QSAR in Environmental Research is an international journal welcoming papers on the fundamental and practical aspects of the structure-activity and structure-property relationships in the fields of environmental science, agrochemistry, toxicology, pharmacology and applied chemistry. A unique aspect of the journal is the focus on emerging techniques for the building of SAR and QSAR models in these widely varying fields. The scope of the journal includes, but is not limited to, the topics of topological and physicochemical descriptors, mathematical, statistical and graphical methods for data analysis, computer methods and programs, original applications and comparative studies. In addition to primary scientific papers, the journal contains reviews of books and software and news of conferences. Special issues on topics of current and widespread interest to the SAR and QSAR community will be published from time to time.