{"title":"三模态蛋白质语言模型使高级蛋白质搜索成为可能。","authors":"Jin Su,Yan He,Shiyang You,Shiyu Jiang,Xibin Zhou,Xuting Zhang,Yuxuan Wang,Xining Su,Igor Tolstoy,Xing Chang,Hongyuan Lu,Fajie Yuan","doi":"10.1038/s41587-025-02836-0","DOIUrl":null,"url":null,"abstract":"ProTrek unifies protein sequence, structure and natural language function in a trimodal language model through contrastive learning, enabling comprehensive searches between any two modalities, including within modality. ProTrek surpasses current alignment tools (for example, Foldseek and MMseqs2) in speed and accuracy for identifying functionally related proteins. Computational and wet-lab experimental validations show that the ProTrek server ( www.search-protrek.com ), with precomputed embeddings for over 5 billion proteins, efficiently processes and analyzes large-scale protein repositories.","PeriodicalId":19084,"journal":{"name":"Nature biotechnology","volume":"23 1","pages":""},"PeriodicalIF":41.7000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A trimodal protein language model enables advanced protein searches.\",\"authors\":\"Jin Su,Yan He,Shiyang You,Shiyu Jiang,Xibin Zhou,Xuting Zhang,Yuxuan Wang,Xining Su,Igor Tolstoy,Xing Chang,Hongyuan Lu,Fajie Yuan\",\"doi\":\"10.1038/s41587-025-02836-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ProTrek unifies protein sequence, structure and natural language function in a trimodal language model through contrastive learning, enabling comprehensive searches between any two modalities, including within modality. ProTrek surpasses current alignment tools (for example, Foldseek and MMseqs2) in speed and accuracy for identifying functionally related proteins. Computational and wet-lab experimental validations show that the ProTrek server ( www.search-protrek.com ), with precomputed embeddings for over 5 billion proteins, efficiently processes and analyzes large-scale protein repositories.\",\"PeriodicalId\":19084,\"journal\":{\"name\":\"Nature biotechnology\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":41.7000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature biotechnology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1038/s41587-025-02836-0\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature biotechnology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1038/s41587-025-02836-0","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
A trimodal protein language model enables advanced protein searches.
ProTrek unifies protein sequence, structure and natural language function in a trimodal language model through contrastive learning, enabling comprehensive searches between any two modalities, including within modality. ProTrek surpasses current alignment tools (for example, Foldseek and MMseqs2) in speed and accuracy for identifying functionally related proteins. Computational and wet-lab experimental validations show that the ProTrek server ( www.search-protrek.com ), with precomputed embeddings for over 5 billion proteins, efficiently processes and analyzes large-scale protein repositories.
期刊介绍:
Nature Biotechnology is a monthly journal that focuses on the science and business of biotechnology. It covers a wide range of topics including technology/methodology advancements in the biological, biomedical, agricultural, and environmental sciences. The journal also explores the commercial, political, ethical, legal, and societal aspects of this research.
The journal serves researchers by providing peer-reviewed research papers in the field of biotechnology. It also serves the business community by delivering news about research developments. This approach ensures that both the scientific and business communities are well-informed and able to stay up-to-date on the latest advancements and opportunities in the field.
Some key areas of interest in which the journal actively seeks research papers include molecular engineering of nucleic acids and proteins, molecular therapy, large-scale biology, computational biology, regenerative medicine, imaging technology, analytical biotechnology, applied immunology, food and agricultural biotechnology, and environmental biotechnology.
In summary, Nature Biotechnology is a comprehensive journal that covers both the scientific and business aspects of biotechnology. It strives to provide researchers with valuable research papers and news while also delivering important scientific advancements to the business community.