{"title":"PhaSepDB 3.0: a comprehensive knowledgebase of phase separation-related proteins from AI-assisted curation.","authors":"Kaiqiang You,Runyu Li,Ruixin Lian,Yuxuan Li,Hongzhining Yang,Yiran Zhou,Yangsheng Chen,Likun Wang,Zhaoqing Fan,Liwei Ma,Tingting Li","doi":"10.1093/nar/gkaf973","DOIUrl":null,"url":null,"abstract":"Phase separation (PS) is a fundamental principle driving the formation of membraneless organelles (MLOs), which are critical for various cellular functions and pathological conditions. We present PhaSepDB 3.0 (https://db.phasep.pro/), a significantly updated knowledgebase of proteins related to PS. To address the challenges of curating a vast body of literature, we have implemented a novel human-AI collaborative workflow that integrates a large language model (LLM)-based agentic system with expert verification, enabling a major expansion and enrichment of the database. PhaSepDB 3.0 now contains 3,484 expert-curated entries for 1849 PS-related proteins, more than doubling the content of the previous version. The annotation framework has been restructured to capture deeper insights, including functional relevance, experimental evidence, and the intrinsic and extrinsic regulations of PS. A key new feature is the protein-wise summary page, which synthesizes data from multiple publications to provide a comprehensive overview of each protein's PS behaviour and functional relevance. With redesigned, user-friendly web interfaces, PhaSepDB 3.0 serves as a critical resource for the community, supporting researchers to explore the intricate basis of PS and its biological implications in greater detail.","PeriodicalId":19471,"journal":{"name":"Nucleic Acids Research","volume":"111 1","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nucleic Acids Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/nar/gkaf973","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Phase separation (PS) is a fundamental principle driving the formation of membraneless organelles (MLOs), which are critical for various cellular functions and pathological conditions. We present PhaSepDB 3.0 (https://db.phasep.pro/), a significantly updated knowledgebase of proteins related to PS. To address the challenges of curating a vast body of literature, we have implemented a novel human-AI collaborative workflow that integrates a large language model (LLM)-based agentic system with expert verification, enabling a major expansion and enrichment of the database. PhaSepDB 3.0 now contains 3,484 expert-curated entries for 1849 PS-related proteins, more than doubling the content of the previous version. The annotation framework has been restructured to capture deeper insights, including functional relevance, experimental evidence, and the intrinsic and extrinsic regulations of PS. A key new feature is the protein-wise summary page, which synthesizes data from multiple publications to provide a comprehensive overview of each protein's PS behaviour and functional relevance. With redesigned, user-friendly web interfaces, PhaSepDB 3.0 serves as a critical resource for the community, supporting researchers to explore the intricate basis of PS and its biological implications in greater detail.
期刊介绍:
Nucleic Acids Research (NAR) is a scientific journal that publishes research on various aspects of nucleic acids and proteins involved in nucleic acid metabolism and interactions. It covers areas such as chemistry and synthetic biology, computational biology, gene regulation, chromatin and epigenetics, genome integrity, repair and replication, genomics, molecular biology, nucleic acid enzymes, RNA, and structural biology. The journal also includes a Survey and Summary section for brief reviews. Additionally, each year, the first issue is dedicated to biological databases, and an issue in July focuses on web-based software resources for the biological community. Nucleic Acids Research is indexed by several services including Abstracts on Hygiene and Communicable Diseases, Animal Breeding Abstracts, Agricultural Engineering Abstracts, Agbiotech News and Information, BIOSIS Previews, CAB Abstracts, and EMBASE.