Yichao Zhao, Ju Xiang, Xingyuan Shi, Pengzhen Jia, Yan Zhang, Min Li
{"title":"MDDOmics: multi-omics resource of major depressive disorder.","authors":"Yichao Zhao, Ju Xiang, Xingyuan Shi, Pengzhen Jia, Yan Zhang, Min Li","doi":"10.1093/database/baae042","DOIUrl":"10.1093/database/baae042","url":null,"abstract":"<p><p>Major depressive disorder (MDD) is a pressing global health issue. Its pathogenesis remains elusive, but numerous studies have revealed its intricate associations with various biological factors. Consequently, there is an urgent need for a comprehensive multi-omics resource to help researchers in conducting multi-omics data analysis for MDD. To address this issue, we constructed the MDDOmics database (Major Depressive Disorder Omics, (https://www.csuligroup.com/MDDOmics/), which integrates an extensive collection of published multi-omics data related to MDD. The database contains 41 222 entries of MDD research results and several original datasets, including Single Nucleotide Polymorphisms, genes, non-coding RNAs, DNA methylations, metabolites and proteins, and offers various interfaces for searching and visualization. We also provide extensive downstream analyses of the collected MDD data, including differential analysis, enrichment analysis and disease-gene prediction. Moreover, the database also incorporates multi-omics data for bipolar disorder, schizophrenia and anxiety disorder, due to the challenge in differentiating MDD from similar psychiatric disorders. In conclusion, by leveraging the rich content and online interfaces from MDDOmics, researchers can conduct more comprehensive analyses of MDD and its similar disorders from various perspectives, thereby gaining a deeper understanding of potential MDD biomarkers and intricate disease pathogenesis. Database URL: https://www.csuligroup.com/MDDOmics/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11197964/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141450008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction to: An interactive web application for exploring systemic lupus erythematosus blood transcriptomic diversity.","authors":"","doi":"10.1093/database/baae059","DOIUrl":"10.1093/database/baae059","url":null,"abstract":"","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11197958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141450007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DMLS: an automated pipeline to extract the Drosophila modular transcription regulators and targets from massive literature articles.","authors":"Tzu-Hsien Yang, Yu-Huai Yu, Sheng-Hang Wu, Fang-Yuan Chang, Hsiu-Chun Tsai, Ya-Chiao Yang","doi":"10.1093/database/baae049","DOIUrl":"10.1093/database/baae049","url":null,"abstract":"<p><p>Transcription regulation in multicellular species is mediated by modular transcription factor (TF) binding site combinations termed cis-regulatory modules (CRMs). Such CRM-mediated transcription regulation determines the gene expression patterns during development. Biologists frequently investigate CRM transcription regulation on gene expressions. However, the knowledge of the target genes and regulatory TFs participating in the CRMs under study is mostly fragmentary throughout the literature. Researchers need to afford tremendous human resources to fully surf through the articles deposited in biomedical literature databases in order to obtain the information. Although several novel text-mining systems are now available for literature triaging, these tools do not specifically focus on CRM-related literature prescreening, failing to correctly extract the information of the CRM target genes and regulatory TFs from the literature. For this reason, we constructed a supportive auto-literature prescreener called Drosophila Modular transcription-regulation Literature Screener (DMLS) that achieves the following: (i) prescreens articles describing experiments on modular transcription regulation, (ii) identifies the described target genes and TFs of the CRMs under study for each modular transcription-regulation-describing article and (iii) features an automated and extendable pipeline to perform the task. We demonstrated that the final performance of DMLS in extracting the described target gene and regulatory TF lists of CRMs under study for given articles achieved test macro area under the ROC curve (auROC) = 89.7% and area under the precision-recall curve (auPRC) = 77.6%, outperforming the intuitive gene name-occurrence-counting method by at least 19.9% in auROC and 30.5% in auPRC. The web service and the command line versions of DMLS are available at https://cobis.bme.ncku.edu.tw/DMLS/ and https://github.com/cobisLab/DMLS/, respectively. Database Tool URL: https://cobis.bme.ncku.edu.tw/DMLS/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11188685/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141431635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Correction to: A Terpenoids Database with the Chemical Content as A Novel Agronomic Trait.","authors":"","doi":"10.1093/database/baae050","DOIUrl":"10.1093/database/baae050","url":null,"abstract":"","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11186476/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141426574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chong Peng, Xiaofeng Liu, Xiangbo Meng, Congge Chen, Xinming Wu, Lin Bai, Fuping Lu, Fufeng Liu
{"title":"IPAD-DB: a manually curated database for experimentally verified inhibitors of proteins associated with Alzheimer's disease.","authors":"Chong Peng, Xiaofeng Liu, Xiangbo Meng, Congge Chen, Xinming Wu, Lin Bai, Fuping Lu, Fufeng Liu","doi":"10.1093/database/baae048","DOIUrl":"10.1093/database/baae048","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a universal neurodegenerative disease with the feature of progressive dementia. Currently, there are only seven Food and Drug Administration-approved drugs for the treatment of AD, which merely offer temporary relief from symptom deterioration without reversing the underlying disease process. The identification of inhibitors capable of interacting with proteins associated with AD plays a pivotal role in the development of effective therapeutic interventions. However, a vast number of such inhibitors are dispersed throughout numerous published articles, rendering it inconvenient for researchers to explore potential drug candidates for AD. In light of this, we have manually compiled inhibitors targeting proteins associated with AD and constructed a comprehensive database known as IPAD-DB (Inhibitors of Proteins associated with Alzheimer's Disease Database). The curated inhibitors within this database encompass a diverse range of compounds, including natural compounds, synthetic compounds, drugs, natural extracts and nano-inhibitors. To date, the database has compiled >4800 entries, each representing a correspondent relationship between an inhibitor and its target protein. IPAD-DB offers a user-friendly interface that facilitates browsing, searching and downloading of its records. We firmly believe that IPAD-DB represents a valuable resource for screening potential AD drug candidates and investigating the underlying mechanisms of this debilitating disease. Access to IPAD-DB is freely available at http://www.lamee.cn/ipad-db/ and is compatible with all major web browsers. Database URL: http://www.lamee.cn/ipad-db/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168334/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141310308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Leho Tedersoo, Mahdieh S Hosseyni Moghaddam, Vladimir Mikryukov, Ali Hakimzadeh, Mohammad Bahram, R Henrik Nilsson, Iryna Yatsiuk, Stefan Geisen, Arne Schwelm, Kasia Piwosz, Marko Prous, Sirje Sildever, Dominika Chmolowska, Sonja Rueckert, Pavel Skaloud, Peeter Laas, Marco Tines, Jae-Ho Jung, Ji Hye Choi, Saad Alkahtani, Sten Anslan
{"title":"EUKARYOME: the rRNA gene reference database for identification of all eukaryotes.","authors":"Leho Tedersoo, Mahdieh S Hosseyni Moghaddam, Vladimir Mikryukov, Ali Hakimzadeh, Mohammad Bahram, R Henrik Nilsson, Iryna Yatsiuk, Stefan Geisen, Arne Schwelm, Kasia Piwosz, Marko Prous, Sirje Sildever, Dominika Chmolowska, Sonja Rueckert, Pavel Skaloud, Peeter Laas, Marco Tines, Jae-Ho Jung, Ji Hye Choi, Saad Alkahtani, Sten Anslan","doi":"10.1093/database/baae043","DOIUrl":"10.1093/database/baae043","url":null,"abstract":"<p><p>Molecular identification of micro- and macroorganisms based on nuclear markers has revolutionized our understanding of their taxonomy, phylogeny and ecology. Today, research on the diversity of eukaryotes in global ecosystems heavily relies on nuclear ribosomal RNA (rRNA) markers. Here, we present the research community-curated reference database EUKARYOME for nuclear ribosomal 18S rRNA, internal transcribed spacer (ITS) and 28S rRNA markers for all eukaryotes, including metazoans (animals), protists, fungi and plants. It is particularly useful for the identification of arbuscular mycorrhizal fungi as it bridges the four commonly used molecular markers-ITS1, ITS2, 18S V4-V5 and 28S D1-D2 subregions. The key benefits of this database over other annotated reference sequence databases are that it is not restricted to certain taxonomic groups and it includes all rRNA markers. EUKARYOME also offers a number of reference long-read sequences that are derived from (meta)genomic and (meta)barcoding-a unique feature that can be used for taxonomic identification and chimera control of third-generation, long-read, high-throughput sequencing data. Taxonomic assignments of rRNA genes in the database are verified based on phylogenetic approaches. The reference datasets are available in multiple formats from the project homepage, http://www.eukaryome.org.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":3.4,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11168333/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141310307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniel Faria, Patrícia Eugénio, Marta Contreiras Silva, Laura Balbi, Georges Bedran, Ashwin Adrian Kallor, Susana Nunes, Aleksander Palkowski, Michal Waleron, Javier A Alfaro, Catia Pesquita
{"title":"The Immunopeptidomics Ontology (ImPO).","authors":"Daniel Faria, Patrícia Eugénio, Marta Contreiras Silva, Laura Balbi, Georges Bedran, Ashwin Adrian Kallor, Susana Nunes, Aleksander Palkowski, Michal Waleron, Javier A Alfaro, Catia Pesquita","doi":"10.1093/database/baae014","DOIUrl":"10.1093/database/baae014","url":null,"abstract":"<p><p>The adaptive immune response plays a vital role in eliminating infected and aberrant cells from the body. This process hinges on the presentation of short peptides by major histocompatibility complex Class I molecules on the cell surface. Immunopeptidomics, the study of peptides displayed on cells, delves into the wide variety of these peptides. Understanding the mechanisms behind antigen processing and presentation is crucial for effectively evaluating cancer immunotherapies. As an emerging domain, immunopeptidomics currently lacks standardization-there is neither an established terminology nor formally defined semantics-a critical concern considering the complexity, heterogeneity, and growing volume of data involved in immunopeptidomics studies. Additionally, there is a disconnection between how the proteomics community delivers the information about antigen presentation and its uptake by the clinical genomics community. Considering the significant relevance of immunopeptidomics in cancer, this shortcoming must be addressed to bridge the gap between research and clinical practice. In this work, we detail the development of the ImmunoPeptidomics Ontology, ImPO, the first effort at standardizing the terminology and semantics in the domain. ImPO aims to encapsulate and systematize data generated by immunopeptidomics experimental processes and bioinformatics analysis. ImPO establishes cross-references to 24 relevant ontologies, including the National Cancer Institute Thesaurus, Mondo Disease Ontology, Logical Observation Identifier Names and Codes and Experimental Factor Ontology. Although ImPO was developed using expert knowledge to characterize a large and representative data collection, it may be readily used to encode other datasets within the domain. Ultimately, ImPO facilitates data integration and analysis, enabling querying, inference and knowledge generation and importantly bridging the gap between the clinical proteomics and genomics communities. As the field of immunogenomics uses protein-level immunopeptidomics data, we expect ImPO to play a key role in supporting a rich and standardized description of the large-scale data that emerging high-throughput technologies are expected to bring in the near future. Ontology URL: https://zenodo.org/record/10237571 Project GitHub: https://github.com/liseda-lab/ImPO/blob/main/ImPO.owl.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11164101/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141300289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shumin Ren, Lin Yang, Jiale Du, Mengqiao He, Bairong Shen
{"title":"DRGKB: a knowledgebase of worldwide diagnosis-related groups' practices for comparison, evaluation and knowledge-guided application.","authors":"Shumin Ren, Lin Yang, Jiale Du, Mengqiao He, Bairong Shen","doi":"10.1093/database/baae046","DOIUrl":"10.1093/database/baae046","url":null,"abstract":"<p><p>As a prospective payment method, diagnosis-related groups (DRGs)'s implementation has varying effects on different regions and adopt different case classification systems. Our goal is to build a structured public online knowledgebase describing the worldwide practice of DRGs, which includes systematic indicators for DRGs' performance assessment. Therefore, we manually collected the qualified literature from PUBMED and constructed DRGKB website. We divided the evaluation indicators into four categories, including (i) medical service quality; (ii) medical service efficiency; (iii) profitability and sustainability; (iv) case grouping ability. Then we carried out descriptive analysis and comprehensive scoring on outcome measurements performance, improvement strategy and specialty performance. At last, the DRGKB finally contains 297 entries. It was found that DRGs generally have a considerable impact on hospital operations, including average length of stay, medical quality and use of medical resources. At the same time, the current DRGs also have many deficiencies, including insufficient reimbursement rates and the ability to classify complex cases. We analyzed these underperforming parts by domain. In conclusion, this research innovatively constructed a knowledgebase to quantify the practice effects of DRGs, analyzed and visualized the development trends and area performance from a comprehensive perspective. This study provides a data-driven research paradigm for following DRGs-related work along with a proposed DRGs evolution model. Availability and implementation: DRGKB is freely available at http://www.sysbio.org.cn/drgkb/. Database URL: http://www.sysbio.org.cn/drgkb/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11155695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141283277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"MechanoProDB: a web-based database for exploring the mechanical properties of proteins.","authors":"Ismahene Mesbah, Bianca Habermann, Felix Rico","doi":"10.1093/database/baae047","DOIUrl":"10.1093/database/baae047","url":null,"abstract":"<p><p>The mechanical stability of proteins is crucial for biological processes. To understand the mechanical functions of proteins, it is important to know the protein structure and mechanical properties. Protein mechanics is usually investigated through force spectroscopy experiments and simulations that probe the forces required to unfold the protein of interest. While there is a wealth of data in the literature on force spectroscopy experiments and steered molecular dynamics simulations of forced protein unfolding, this information is spread and difficult to access by non-experts. Here, we introduce MechanoProDB, a novel web-based database resource for collecting and mining data obtained from experimental and computational works. MechanoProDB provides a curated repository for a wide range of proteins, including muscle proteins, adhesion molecules and membrane proteins. The database incorporates relevant parameters that provide insights into the mechanical stability of proteins and their conformational stability such as the unfolding forces, energy landscape parameters and contour lengths of unfolding steps. Additionally, it provides intuitive annotations of the unfolding pathways of each protein, allowing users to explore the individual steps during mechanical unfolding. The user-friendly interface of MechanoProDB allows researchers to efficiently navigate, search and download data pertaining to specific protein folds or experimental conditions. Users can visualize protein structures using interactive tools integrated within the database, such as Mol*, and plot available data through integrated plotting tools. To ensure data quality and reliability, we have carefully manually verified and curated the data currently available on MechanoProDB. Furthermore, the database also features an interface that enables users to contribute new data and annotations, promoting community-driven comprehensiveness. The freely available MechanoProDB aims to streamline and accelerate research in the field of mechanobiology and biophysics by offering a unique platform for data sharing and analysis. MechanoProDB is freely available at https://mechanoprodb.ibdm.univ-amu.fr.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11152175/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141260928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"FishTEDB 2.0: an update fish transposable element (TE) database with new functions to facilitate TE research.","authors":"Feng Shao, Minzhi Zeng, Xiaofei Xu, Huahao Zhang, Zuogang Peng","doi":"10.1093/database/baae044","DOIUrl":"10.1093/database/baae044","url":null,"abstract":"<p><p>We launched the initial version of FishTEDB in 2018, which aimed to establish an open-source, user-friendly, data-rich transposable element (TE) database. Over the past 5 years, FishTEDB 1.0 has gained approximately 10 000 users, accumulating more than 450 000 interactions. With the unveiling of extensive fish genome data and the increasing emphasis on TE research, FishTEDB needs to extend the richness of data and functions. To achieve the above goals, we introduced 33 new fish species to FishTEDB 2.0, encompassing a wide array of fish belonging to 48 orders. To make the updated database more functional, we added a genome browser to visualize the positional relationship between TEs and genes and the estimated TE insertion time in different species. In conclusion, we released a new version of the fish TE database, FishTEDB 2.0, designed to assist researchers in the future study of TE functions and promote the progress of biological theories related to TEs. Database URL: https://www.fishtedb.com/.</p>","PeriodicalId":10923,"journal":{"name":"Database: The Journal of Biological Databases and Curation","volume":null,"pages":null},"PeriodicalIF":5.8,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11146639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141237492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}