{"title":"AI-guided discovery and optimization of antimicrobial peptides through species-aware language model.","authors":"Daehun Bae, Minsang Kim, Jiwon Seo, Hojung Nam","doi":"10.1093/bib/bbaf343","DOIUrl":null,"url":null,"abstract":"<p><p>The rise of antibiotic-resistant bacteria drives an urgent need for novel antimicrobial agents. Antimicrobial peptides (AMPs) show promise solutions due to their multiple mechanisms of action and reduced propensity for resistance development. This study introduces LLAMP (Large Language model for AMP activity prediction), a target species-aware AI model that leverages pre-trained language models to predict minimum inhibitory concentration values of AMPs. Using LLAMP, we screened approximately 5.5 million peptide sequences, identifying peptides 13 and 16 as the most selective and most potent candidates, respectively. Analysis of attention values allowed us to pinpoint critical amino acid residues (e.g., Trp, Lys, and Phe). Using the critical amino acids, the sequence of the most selective peptide 13 was engineered to increase amphipathicity through targeted modifications, yielding peptide 13-5 with an overall enhancement in antimicrobial activity but a reduction in selectively. Notably, peptides 13-5 and 16 demonstrated antimicrobial potency and selectivity comparable to the clinically investigated AMP pexiganan. Our work demonstrates the potential of AI to expedite the discovery of peptide-based antibiotics to combat antibiotic resistance.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 4","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271573/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf343","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The rise of antibiotic-resistant bacteria drives an urgent need for novel antimicrobial agents. Antimicrobial peptides (AMPs) show promise solutions due to their multiple mechanisms of action and reduced propensity for resistance development. This study introduces LLAMP (Large Language model for AMP activity prediction), a target species-aware AI model that leverages pre-trained language models to predict minimum inhibitory concentration values of AMPs. Using LLAMP, we screened approximately 5.5 million peptide sequences, identifying peptides 13 and 16 as the most selective and most potent candidates, respectively. Analysis of attention values allowed us to pinpoint critical amino acid residues (e.g., Trp, Lys, and Phe). Using the critical amino acids, the sequence of the most selective peptide 13 was engineered to increase amphipathicity through targeted modifications, yielding peptide 13-5 with an overall enhancement in antimicrobial activity but a reduction in selectively. Notably, peptides 13-5 and 16 demonstrated antimicrobial potency and selectivity comparable to the clinically investigated AMP pexiganan. Our work demonstrates the potential of AI to expedite the discovery of peptide-based antibiotics to combat antibiotic resistance.
耐药细菌的增加促使人们迫切需要新型抗菌剂。抗菌肽(AMPs)由于其多种作用机制和降低耐药倾向而显示出很好的解决方案。本研究引入LLAMP (Large Language model for AMP activity prediction),这是一种目标物种感知的人工智能模型,利用预训练的语言模型来预测AMP的最小抑制浓度值。使用LLAMP,我们筛选了大约550万个肽序列,分别确定了肽13和肽16为最具选择性和最有效的候选。对注意值的分析使我们能够确定关键氨基酸残基(例如,色氨酸、赖氨酸和苯丙氨酸)。利用关键氨基酸,最具选择性的肽13序列通过靶向修饰增加了两致病性,产生的肽13-5总体上增强了抗菌活性,但选择性降低了。值得注意的是,肽13-5和16显示出与临床研究的AMP培昔加南相当的抗菌效力和选择性。我们的工作证明了人工智能在加速发现基于肽的抗生素以对抗抗生素耐药性方面的潜力。
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.