Database: The Journal of Biological Databases and Curation最新文献

筛选
英文 中文
RegulaTome: a corpus of typed, directed, and signed relations between biomedical entities in the scientific literature. RegulaTome:科学文献中生物医学实体之间的类型化、定向和签名关系语料库。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-09-12 DOI: 10.1093/database/baae095
Katerina Nastou, Farrokh Mehryary, Tomoko Ohta, Jouni Luoma, Sampo Pyysalo, Lars Juhl Jensen
{"title":"RegulaTome: a corpus of typed, directed, and signed relations between biomedical entities in the scientific literature.","authors":"Katerina Nastou, Farrokh Mehryary, Tomoko Ohta, Jouni Luoma, Sampo Pyysalo, Lars Juhl Jensen","doi":"10.1093/database/baae095","DOIUrl":"10.1093/database/baae095","url":null,"abstract":"<p><p>In the field of biomedical text mining, the ability to extract relations from the literature is crucial for advancing both theoretical research and practical applications. There is a notable shortage of corpora designed to enhance the extraction of multiple types of relations, particularly focusing on proteins and protein-containing entities such as complexes and families, as well as chemicals. In this work, we present RegulaTome, a corpus that overcomes the limitations of several existing biomedical relation extraction (RE) corpora, many of which concentrate on single-type relations at the sentence level. RegulaTome stands out by offering 16 961 relations annotated in >2500 documents, making it the most extensive dataset of its kind to date. This corpus is specifically designed to cover a broader spectrum of >40 relation types beyond those traditionally explored, setting a new benchmark in the complexity and depth of biomedical RE tasks. Our corpus both broadens the scope of detected relations and allows for achieving noteworthy accuracy in RE. A transformer-based model trained on this corpus has demonstrated a promising F1-score (66.6%) for a task of this complexity, underscoring the effectiveness of our approach in accurately identifying and categorizing a wide array of biological relations. This achievement highlights RegulaTome's potential to significantly contribute to the development of more sophisticated, efficient, and accurate RE systems to tackle biomedical tasks. Finally, a run of the trained RE system on all PubMed abstracts and PMC Open Access full-text documents resulted in >18 million relations, extracted from the entire biomedical literature.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11394941/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142281706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transformer-based approach for symptom recognition and multilingual linking 基于变换器的症状识别和多语言链接方法
IF 5.8 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-09-11 DOI: 10.1093/database/baae090
Sylvia Vassileva, Georgi Grazhdanski, Ivan Koychev, Svetla Boytcheva
{"title":"Transformer-based approach for symptom recognition and multilingual linking","authors":"Sylvia Vassileva, Georgi Grazhdanski, Ivan Koychev, Svetla Boytcheva","doi":"10.1093/database/baae090","DOIUrl":"https://doi.org/10.1093/database/baae090","url":null,"abstract":"This paper presents a transformer-based approach for symptom Named Entity Recognition (NER) in Spanish clinical texts and multilingual entity linking on the SympTEMIST dataset. For Spanish NER, we fine tune a RoBERTa-based token-level classifier with Bidirectional Long Short-Term Memory and conditional random field layers on an augmented train set, achieving an F1 score of 0.73. Entity linking is performed via a hybrid approach with dictionaries, generating candidates from a knowledge base containing Unified Medical Language System aliases using the cross-lingual SapBERT and reranking the top candidates using GPT-3.5. The entity linking approach shows consistent results for multiple languages of 0.73 accuracy on the SympTEMIST multilingual dataset and also achieves an accuracy of 0.6123 on the Spanish entity linking task surpassing the current top score for this subtask. Database URL: https://github.com/svassileva/symptemist-multilingual-linking","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Interactive tools for functional annotation of bacterial genomes. 细菌基因组功能注释互动工具。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-09-06 DOI: 10.1093/database/baae089
Morgan N Price, Adam P Arkin
{"title":"Interactive tools for functional annotation of bacterial genomes.","authors":"Morgan N Price, Adam P Arkin","doi":"10.1093/database/baae089","DOIUrl":"10.1093/database/baae089","url":null,"abstract":"<p><p>Automated annotations of protein functions are error-prone because of our lack of knowledge of protein functions. For example, it is often impossible to predict the correct substrate for an enzyme or a transporter. Furthermore, much of the knowledge that we do have about the functions of proteins is missing from the underlying databases. We discuss how to use interactive tools to quickly find different kinds of information relevant to a protein's function. Many of these tools are available via PaperBLAST (http://papers.genomics.lbl.gov). Combining these tools often allows us to infer a protein's function. Ideally, accurate annotations would allow us to predict a bacterium's capabilities from its genome sequence, but in practice, this remains challenging. We describe interactive tools that infer potential capabilities from a genome sequence or that search a genome to find proteins that might perform a specific function of interest. Database URL: http://papers.genomics.lbl.gov.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11378808/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142143039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Globin Network and adopting genomic variant database requirements for thalassemia. 全球球蛋白网络和采用地中海贫血基因组变异数据库要求。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-09-04 DOI: 10.1093/database/baae080
Hashim Halim-Fikri, Ninie Nadia Zulkipli, Hafiza Alauddin, Celeste Bento, Carsten W Lederer, Petros Kountouris, Marina Kleanthous, Yetti Hernaningsih, Meow-Keong Thong, Muhammad Hamdi Mahmood, Norafiza Mohd Yasin, Ezalia Esa, Jacques Elion, Domenico Coviello, Raja-Zahratul-Azma Raja-Sabudin, Ghada El-Kamah, John Burn, Narazah Mohd Yusoff, Raj Ramesar, Bin Alwi Zilfalil
{"title":"Global Globin Network and adopting genomic variant database requirements for thalassemia.","authors":"Hashim Halim-Fikri, Ninie Nadia Zulkipli, Hafiza Alauddin, Celeste Bento, Carsten W Lederer, Petros Kountouris, Marina Kleanthous, Yetti Hernaningsih, Meow-Keong Thong, Muhammad Hamdi Mahmood, Norafiza Mohd Yasin, Ezalia Esa, Jacques Elion, Domenico Coviello, Raja-Zahratul-Azma Raja-Sabudin, Ghada El-Kamah, John Burn, Narazah Mohd Yusoff, Raj Ramesar, Bin Alwi Zilfalil","doi":"10.1093/database/baae080","DOIUrl":"10.1093/database/baae080","url":null,"abstract":"<p><p>Thalassemia is one of the most prevalent monogenic disorders in low- and middle-income countries (LMICs). There are an estimated 270 million carriers of hemoglobinopathies (abnormal hemoglobins and/or thalassemia) worldwide, necessitating global methods and solutions for effective and optimal therapy. LMICs are disproportionately impacted by thalassemia, and due to disparities in genomics awareness and diagnostic resources, certain LMICs lag behind high-income countries (HICs). This spurred the establishment of the Global Globin Network (GGN) in 2015 at UNESCO, Paris, as a project-wide endeavor within the Human Variome Project (HVP). Primarily aimed at enhancing thalassemia clinical services, research, and genomic diagnostic capabilities with a focus on LMIC needs, GGN aims to foster data collection in a shared database by all affected nations, thus improving data sharing and thalassemia management. In this paper, we propose a minimum requirement for establishing a genomic database in thalassemia based on the HVP database guidelines. We suggest using an existing platform recommended by HVP, the Leiden Open Variation Database (LOVD) (https://www.lovd.nl/). Adoption of our proposed criteria will assist in improving or supplementing the existing databases, allowing for better-quality services for individuals with thalassemia. Database URL: https://www.lovd.nl/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11373567/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142132087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes. CuPCA:用于大规模杯突相关基因泛癌症关联分析的网络服务器。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-09-03 DOI: 10.1093/database/baae075
Yishu Xu, Zhenshu Ma, Yajie Wang, Long Zhang, Jiaming Ye, Yuan Chen, Zhengrong Yuan
{"title":"CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes.","authors":"Yishu Xu, Zhenshu Ma, Yajie Wang, Long Zhang, Jiaming Ye, Yuan Chen, Zhengrong Yuan","doi":"10.1093/database/baae075","DOIUrl":"10.1093/database/baae075","url":null,"abstract":"<p><p>Copper-induced cell death is a novel mechanism of cell death, which is defined as cuproptosis. The increasing level of copper can produce toxicity in cells and may cause the occurrence of cell death. Several previous studies have proved that cuproptosis has a tight association with various cancers. Thus, the discovery of relationships between cuproptosis-related genes (CRGs) and human cancers is of great importance. Pan-cancer analysis can efficiently help researchers find out the relationship between multiple cancers and target genes precisely and make various prognostic analyses on cancers and cancer patients. Pan-cancer web servers can provide researchers with direct results of pan-cancer prognostic analyses, which can greatly improve the efficiency of their work. However, to date, no web server provides pan-cancer analysis about CRGs. Therefore, we introduce the cuproptosis pan-cancer analysis database (CuPCA), the first database for various analysis results of CRGs through 33 cancer types. CuPCA is a user-friendly resource for cancer researchers to gain various prognostic analyses between cuproptosis and cancers. It provides single CRG pan-cancer analysis, multi-CRGs pan-cancer analysis, multi-CRlncRNA pan-cancer analysis, and mRNA-circRNA-lncRNA conjoint analysis. These analysis results can not only indicate the relationship between cancers and cuproptosis at both gene level and protein level, but also predict the conditions of different cancer patients, which include their clinical condition, survival condition, and their immunological condition. CuPCA procures the delivery of analyzed data to end users, which improves the efficiency of wide research as well as releases the value of data resources. Database URL: http://cupca.cn/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11373563/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142132086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CBGDA: a manually curated resource for gene-disease associations based on genome-wide CRISPR. CBGDA:基于全基因组 CRISPR 的人工编辑的基因-疾病关联资源。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-08-30 DOI: 10.1093/database/baae077
Qingsong Du, Zhiyu Zhang, Wanyi Yang, Xunyu Zhou, Nan Zhou, Chuanfang Wu, Jinku Bao
{"title":"CBGDA: a manually curated resource for gene-disease associations based on genome-wide CRISPR.","authors":"Qingsong Du, Zhiyu Zhang, Wanyi Yang, Xunyu Zhou, Nan Zhou, Chuanfang Wu, Jinku Bao","doi":"10.1093/database/baae077","DOIUrl":"10.1093/database/baae077","url":null,"abstract":"<p><p>The field of understanding the association between genes and diseases is rapidly expanding, making it challenging for researchers to keep up with the influx of new publications and genetic datasets. Fortunately, there are now several regularly updated databases available that focus on cataloging gene-disease relationships. The development of the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas9 system has revolutionized the field of gene editing, providing a highly efficient, accurate, and reliable method for exploring gene-disease associations. However, currently, there is no resource specifically dedicated to collecting and integrating the latest experimentally supported gene-disease association data derived from genome-wide CRISPR screening. To address this gap, we have developed the CRISPR-Based Gene-Disease Associations (CBGDA) database, which includes over 200 manually curated gene-disease association data derived from genome-wide CRISPR screening studies. Through CBGDA, users can explore gene-disease association data derived from genome-wide CRISPR screening, gaining insights into the expression patterns of genes in different diseases, associated chemical data, and variant information. This provides a novel perspective on understanding the associations between genes and diseases. What is more, CBGDA integrates data from several other databases and resources, enhancing its comprehensiveness and utility. In summary, CBGDA offers a fresh perspective and comprehensive insights into the research on gene-disease associations. It fills the gap by providing a dedicated resource for accessing up-to-date, experimentally supported gene-disease association data derived from genome-wide CRISPR screening. Database URL: http://cbgda.zhounan.org/main.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363955/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Landscape of racial and ethnic health disparities in the All of Us Research Program. 我们所有人 "研究计划中的种族和民族健康差异情况。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-08-30 DOI: 10.1093/database/baae082
Vincent Lam, Shivam Sharma, John L Spouge, I King Jordan, Leonardo Mariño-Ramírez
{"title":"Landscape of racial and ethnic health disparities in the All of Us Research Program.","authors":"Vincent Lam, Shivam Sharma, John L Spouge, I King Jordan, Leonardo Mariño-Ramírez","doi":"10.1093/database/baae082","DOIUrl":"10.1093/database/baae082","url":null,"abstract":"<p><p>The All of Us Research Program (\"All of Us\") is an initiative led by the National Institutes of Health whose goal is to advance research on personalized medicine and health equity through the collection of genetic, environmental, demographic, and health data from volunteer participants who reside in the USA. The program's emphasis on recruiting a diverse participant cohort makes \"All of Us\" an effective platform for investigating health disparities. In this work, we analyzed participant electronic health record (EHR) data to identify the diseases and disease categories in the \"All of Us\" cohort for which racial and ethnic prevalence disparities can be observed. In conjunction with these analyses, we developed the US Health Disparities Browser as an interactive web application that enables users to visualize differences in race- and ethnic-group-specific prevalence estimates for 1755 different diseases: https://usdisparities.biosci.gatech.edu/. The web application features a catalog of all diseases represented in the browser, which can be sorted by overall prevalence as well as the variance in prevalence across racial and ethnic groups. The analyses outlined here provide details on the nature and extent of racial and ethnic health disparities in the \"All of Us\" participant cohort, and the accompanying browser can serve as a resource through which researchers can explore these disparities Database URL: https://usdisparities.biosci.gatech.edu.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach. 使用基于 BERT 的方法从生物医学文献中自动提取细菌的转录调控相互作用。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-08-30 DOI: 10.1093/database/baae094
Alfredo Varela-Vega, Ali-Berenice Posada-Reyes, Carlos-Francisco Méndez-Cruz
{"title":"Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach.","authors":"Alfredo Varela-Vega, Ali-Berenice Posada-Reyes, Carlos-Francisco Méndez-Cruz","doi":"10.1093/database/baae094","DOIUrl":"10.1093/database/baae094","url":null,"abstract":"<p><p>Transcriptional regulatory networks (TRNs) give a global view of the regulatory mechanisms of bacteria to respond to environmental signals. These networks are published in biological databases as a valuable resource for experimental and bioinformatics researchers. Despite the efforts to publish TRNs of diverse bacteria, many of them still lack one and many of the existing TRNs are incomplete. In addition, the manual extraction of information from biomedical literature (\"literature curation\") has been the traditional way to extract these networks, despite this being demanding and time-consuming. Recently, language models based on pretrained transformers have been used to extract relevant knowledge from biomedical literature. Moreover, the benefit of fine-tuning a large pretrained model with new limited data for a specific task (\"transfer learning\") opens roads to address new problems of biomedical information extraction. Here, to alleviate this lack of knowledge and assist literature curation, we present a new approach based on the Bidirectional Transformer for Language Understanding (BERT) architecture to classify transcriptional regulatory interactions of bacteria as a first step to extract TRNs from literature. The approach achieved a significant performance in a test dataset of sentences of Escherichia coli (F1-Score: 0.8685, Matthew's correlation coefficient: 0.8163). The examination of model predictions revealed that the model learned different ways to express the regulatory interaction. The approach was evaluated to extract a TRN of Salmonella using 264 complete articles. The evaluation showed that the approach was able to accurately extract 82% of the network and that it was able to extract interactions absent in curation data. To the best of our knowledge, the present study is the first effort to obtain a BERT-based approach to extract this specific kind of interaction. This approach is a starting point to address the limitations of reconstructing TRNs of bacteria and diseases of biological interest. Database URL: https://github.com/laigen-unam/BERT-trn-extraction.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363960/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acupuncture indication knowledge bases: meridian entity recognition and classification based on ACUBERT. 针灸适应症知识库:基于 ACUBERT 的经络实体识别与分类。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-08-30 DOI: 10.1093/database/baae083
TianCheng Xu, Jing Wen, Lei Wang, YueYing Huang, ZiJing Zhu, Qian Zhu, Yi Fang, ChengBiao Yang, YouBing Xia
{"title":"Acupuncture indication knowledge bases: meridian entity recognition and classification based on ACUBERT.","authors":"TianCheng Xu, Jing Wen, Lei Wang, YueYing Huang, ZiJing Zhu, Qian Zhu, Yi Fang, ChengBiao Yang, YouBing Xia","doi":"10.1093/database/baae083","DOIUrl":"10.1093/database/baae083","url":null,"abstract":"<p><p>In acupuncture diagnosis and treatment, non-quantitative clinical descriptions have limited the development of standardized treatment methods. This study explores the effectiveness and the reasons for discrepancies in the entity recognition and classification of meridians in acupuncture indication using the Acupuncture Bidirectional Encoder Representations from Transformers (ACUBERT) model. During the research process, we selected 54 593 different entities from 82 acupuncture medical books as the pretraining corpus for medical literature, conducting classification research on Chinese medical literature using the BERT model. Additionally, we employed the support vector machine and Random Forest models as comparative benchmarks and optimized them through parameter tuning, ultimately leading to the development of the ACUBERT model. The results show that the ACUBERT model outperforms other baseline models in classification effectiveness, achieving the best performance at Epoch = 5. The model's \"precision,\" \"recall,\" and F1 scores reached above 0.8. Moreover, our study has a unique feature: it trains the meridian differentiation model based on the eight principles of differentiation and zang-fu differentiation as foundational labels. It establishes an acupuncture-indication knowledge base (ACU-IKD) and ACUBERT model with traditional Chinese medicine characteristics. In summary, the ACUBERT model significantly enhances the classification effectiveness of meridian attribution in the acupuncture indication database and also demonstrates the classification advantages of deep learning methods based on BERT in multi-category, large-scale training sets. Database URL: http://acuai.njucm.edu.cn:8081/#/user/login?tenantUrl=default.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363959/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142105214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autoinhibited Protein Database: a curated database of autoinhibitory domains and their autoinhibition mechanisms. 自动抑制蛋白质数据库:一个关于自动抑制结构域及其自动抑制机制的编辑数据库。
IF 3.4 4区 生物学
Database: The Journal of Biological Databases and Curation Pub Date : 2024-08-28 DOI: 10.1093/database/baae085
Daeahn Cho, Hyang-Mi Lee, Ji Ah Kim, Jae Gwang Song, Su-Hee Hwang, Bomi Lee, Jinsil Park, Kha Mong Tran, Jiwon Kim, Phuong Ngoc Lam Vo, Jooeun Bae, Teerapat Pimt, Kangseok Lee, Jörg Gsponer, Hyung Wook Kim, Dokyun Na
{"title":"Autoinhibited Protein Database: a curated database of autoinhibitory domains and their autoinhibition mechanisms.","authors":"Daeahn Cho, Hyang-Mi Lee, Ji Ah Kim, Jae Gwang Song, Su-Hee Hwang, Bomi Lee, Jinsil Park, Kha Mong Tran, Jiwon Kim, Phuong Ngoc Lam Vo, Jooeun Bae, Teerapat Pimt, Kangseok Lee, Jörg Gsponer, Hyung Wook Kim, Dokyun Na","doi":"10.1093/database/baae085","DOIUrl":"10.1093/database/baae085","url":null,"abstract":"<p><p>Autoinhibition, a crucial allosteric self-regulation mechanism in cell signaling, ensures signal propagation exclusively in the presence of specific molecular inputs. The heightened focus on autoinhibited proteins stems from their implication in human diseases, positioning them as potential causal factors or therapeutic targets. However, the absence of a comprehensive knowledgebase impedes a thorough understanding of their roles and applications in drug discovery. Addressing this gap, we introduce Autoinhibited Protein Database (AiPD), a curated database standardizing information on autoinhibited proteins. AiPD encompasses details on autoinhibitory domains (AIDs), their targets, regulatory mechanisms, experimental validation methods, and implications in diseases, including associated mutations and post-translational modifications. AiPD comprises 698 AIDs from 532 experimentally characterized autoinhibited proteins and 2695 AIDs from their 2096 homologs, which were retrieved from 864 published articles. AiPD also includes 42 520 AIDs of computationally predicted autoinhibited proteins. In addition, AiPD facilitates users in investigating potential AIDs within a query sequence through comparisons with documented autoinhibited proteins. As the inaugural autoinhibited protein repository, AiPD significantly aids researchers studying autoinhibition mechanisms and their alterations in human diseases. It is equally valuable for developing computational models, analyzing allosteric protein regulation, predicting new drug targets, and understanding intervention mechanisms AiPD serves as a valuable resource for diverse researchers, contributing to the understanding and manipulation of autoinhibition in cellular processes. Database URL: http://ssbio.cau.ac.kr/databases/AiPD.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11349611/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142079544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信