{"title":"Discoveries by the genome profiling, symbolic powers of non-next generation sequencing methods.","authors":"Koichi Nishigaki","doi":"10.1093/bfgp/elae047","DOIUrl":"10.1093/bfgp/elae047","url":null,"abstract":"<p><p>Next-generation sequencing and other sequencing approaches have made significant progress in DNA analysis. However, there are indispensable advantages in the nonsequencing methods. They have their justifications such as being speedy, cost-effective, multi-applicable, and straightforward. Among the nonsequencing methods, the genome profiling method is worthy of reviewing because of its high potential. This article first reviews its basic properties, highlights the key concept of species identification dots (spiddos), and then summarizes its various applications.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"775-797"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142741507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yuanfeng Xu, Fan Yu, Wenrong Feng, Jia Wei, Shengyan Su, Jianlin Li, Guoan Hua, Wenjing Li, Yongkai Tang
{"title":"Genetic variation mining of the Chinese mitten crab (Eriocheir sinensis) based on transcriptome data from public databases.","authors":"Yuanfeng Xu, Fan Yu, Wenrong Feng, Jia Wei, Shengyan Su, Jianlin Li, Guoan Hua, Wenjing Li, Yongkai Tang","doi":"10.1093/bfgp/elae030","DOIUrl":"10.1093/bfgp/elae030","url":null,"abstract":"<p><p>At present, public databases house an extensive repository of transcriptome data, with the volume continuing to grow at an accelerated pace. Utilizing these data effectively is a shared interest within the scientific community. In this study, we introduced a novel strategy that harnesses SNPs and InDels identified from transcriptome data, combined with sample metadata from databases, to effectively screen for molecular markers correlated with traits. We utilized 228 transcriptome datasets of Eriocheir sinensis from the NCBI database and employed the Genome Analysis Toolkit software to identify 96 388 SNPs and 20 645 InDels. Employing the genome-wide association study analysis, in conjunction with the gender information from databases, we identified 3456 sex-biased SNPs and 639 sex-biased InDels. The KOG and KEGG annotations of the sex-biased SNPs and InDels revealed that these genes were primarily involved in the metabolic processes of E. sinensis. Combined with SnpEff annotation and PCR experimental validation, a highly sex-biased SNP located in the Kelch domain containing 4 (Klhdc4) gene, CHR67-6415071, was found to alter the splicing sites of Klhdc4, generating two splice variants, Klhdc4_a and Klhdc4_b. Additionally, Klhdc4 exhibited robust expression across the ovaries, testes, and accessory glands. The sex-biased SNPs and InDels identified in this study are conducive to the development of unisexual cultivation methods for E. sinensis, and the alternative splicing event caused by the sex-biased SNP in Klhdc4 may serve as a potential mechanism for sex regulation in E. sinensis. The analysis strategy employed in this study represents a new direction for the rational exploitation and utilization of transcriptome data in public databases.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"816-827"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141565172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Gene regulatory network inference based on novel ensemble method.","authors":"Bin Yang, Jing Li, Xiang Li, Sanrong Liu","doi":"10.1093/bfgp/elae036","DOIUrl":"10.1093/bfgp/elae036","url":null,"abstract":"<p><p>Gene regulatory networks (GRNs) contribute toward understanding the function of genes and the development of cancer or the impact of key genes on diseases. Hence, this study proposes an ensemble method based on 13 basic classification methods and a flexible neural tree (FNT) to improve GRN identification accuracy. The primary classification methods contain ridge classification, stochastic gradient descent, Gaussian process classification, Bernoulli Naive Bayes, adaptive boosting, gradient boosting decision tree, hist gradient boosting classification, eXtreme gradient boosting (XGBoost), multilayer perceptron, light gradient boosting machine, random forest, support vector machine, and k-nearest neighbor algorithm, which are regarded as the input variable set of FNT model. Additionally, a hybrid evolutionary algorithm based on a gene programming variant and particle swarm optimization is developed to search for the optimal FNT model. Experiments on three simulation datasets and three real single-cell RNA-seq datasets demonstrate that the proposed ensemble feature outperforms 13 supervised algorithms, seven unsupervised algorithms (ARACNE, CLR, GENIE3, MRNET, PCACMI, GENECI, and EPCACMI) and four single cell-specific methods (SCODE, BiRGRN, LEAP, and BiGBoost) based on the area under the receiver operating characteristic curve, area under the precision-recall curve, and F1 metrics.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"866-878"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142332842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Diego A Forero, Diego A Bonilla, Yeimy González-Giraldo, George P Patrinos
{"title":"An overview of key online resources for human genomics: a powerful and open toolbox for in silico research.","authors":"Diego A Forero, Diego A Bonilla, Yeimy González-Giraldo, George P Patrinos","doi":"10.1093/bfgp/elae029","DOIUrl":"10.1093/bfgp/elae029","url":null,"abstract":"<p><p>Recent advances in high-throughput molecular methods have led to an extraordinary volume of genomics data. Simultaneously, the progress in the computational implementation of novel algorithms has facilitated the creation of hundreds of freely available online tools for their advanced analyses. However, a general overview of the most commonly used tools for the in silico analysis of genomics data is still missing. In the current article, we present an overview of commonly used online resources for genomics research, including over 50 tools. This selection will be helpful for scientists with basic or intermediate skills in the in silico analyses of genomics data, such as researchers and students from wet labs seeking to strengthen their computational competencies. In addition, we discuss current needs and future perspectives within this field.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"754-764"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141592146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A comprehensive survey of dimensionality reduction and clustering methods for single-cell and spatial transcriptomics data.","authors":"Yidi Sun, Lingling Kong, Jiayi Huang, Hongyan Deng, Xinling Bian, Xingfeng Li, Feifei Cui, Lijun Dou, Chen Cao, Quan Zou, Zilong Zhang","doi":"10.1093/bfgp/elae023","DOIUrl":"10.1093/bfgp/elae023","url":null,"abstract":"<p><p>In recent years, the application of single-cell transcriptomics and spatial transcriptomics analysis techniques has become increasingly widespread. Whether dealing with single-cell transcriptomic or spatial transcriptomic data, dimensionality reduction and clustering are indispensable. Both single-cell and spatial transcriptomic data are often high-dimensional, making the analysis and visualization of such data challenging. Through dimensionality reduction, it becomes possible to visualize the data in a lower-dimensional space, allowing for the observation of relationships and differences between cell subpopulations. Clustering enables the grouping of similar cells into the same cluster, aiding in the identification of distinct cell subpopulations and revealing cellular diversity, providing guidance for downstream analyses. In this review, we systematically summarized the most widely recognized algorithms employed for the dimensionality reduction and clustering analysis of single-cell transcriptomic and spatial transcriptomic data. This endeavor provides valuable insights and ideas that can contribute to the development of novel tools in this rapidly evolving field.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"733-744"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141302188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Characterization of double-stranded RNA and its silencing efficiency for insects using hybrid deep-learning framework.","authors":"Han Cheng, Liping Xu, Cangzhi Jia","doi":"10.1093/bfgp/elae027","DOIUrl":"10.1093/bfgp/elae027","url":null,"abstract":"<p><p>RNA interference (RNAi) technology is widely used in the biological prevention and control of terrestrial insects. One of the main factors with the application of RNAi in insects is the difference in RNAi efficiency, which may vary not only in different insects, but also in different genes of the same insect, and even in different double-stranded RNAs (dsRNAs) of the same gene. This work focuses on the last question and establishes a bioinformatics software that can help researchers screen for the most efficient dsRNA targeting target genes. Among insects, the red flour beetle (Tribolium castaneum) is known to be one of the most sensitive to RNAi. From iBeetle-Base, we extracted 12 027 efficient dsRNA sequences with a lethality rate of ≥20% or with experimentation-induced phenotypic changes and processed these data to correspond to specific silence efficiency. Based on the first complied novel benchmark dataset, we specifically designed a deep neural network to identify and characterize efficient dsRNA for RNAi in insects. The dna2vec word embedding model was trained to extract distributed feature representations, and three powerful modules, namely convolutional neural network, bidirectional long short-term memory network, and self-attention mechanism, were integrated to form our predictor model to characterize the extracted dsRNAs and their silencing efficiencies for T. castaneum. Our model dsRNAPredictor showed reliable performance in multiple independent tests based on different species, including both T. castaneum and Aedes aegypti. This indicates that dsRNAPredictor can facilitate prescreening for designing high-efficiency dsRNA targeting target genes of insects in advance.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"858-865"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141443790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jinglei Zhang, Nan Zhang, Qingyun Mai, Canquan Zhou
{"title":"The frontier of precision medicine: application of single-cell multi-omics in preimplantation genetic diagnosis.","authors":"Jinglei Zhang, Nan Zhang, Qingyun Mai, Canquan Zhou","doi":"10.1093/bfgp/elae041","DOIUrl":"10.1093/bfgp/elae041","url":null,"abstract":"<p><p>The advent of single-cell multi-omics technologies has revolutionized the landscape of preimplantation genetic diagnosis (PGD), offering unprecedented insights into the genetic, transcriptomic, and proteomic profiles of individual cells in early-stage embryos. This breakthrough holds the promise of enhancing the accuracy, efficiency, and scope of PGD, thereby significantly improving outcomes in assisted reproductive technologies (ARTs) and genetic disease prevention. This review provides a comprehensive overview of the importance of PGD in the context of precision medicine and elucidates how single-cell multi-omics technologies have transformed this field. We begin with a brief history of PGD, highlighting its evolution and application in detecting genetic disorders and facilitating ART. Subsequently, we delve into the principles, methodologies, and applications of single-cell genomics, transcriptomics, and proteomics in PGD, emphasizing their role in improving diagnostic precision and efficiency. Furthermore, we review significant recent advances within this domain, including key experimental designs, findings, and their implications for PGD practices. The advantages and limitations of these studies are analyzed to assess their potential impact on the future development of PGD technologies. Looking forward, we discuss the emerging research directions and challenges, focusing on technological advancements, new application areas, and strategies to overcome existing limitations. In conclusion, this review underscores the pivotal role of single-cell multi-omics in PGD, highlighting its potential to drive the progress of precision medicine and personalized treatment strategies, thereby marking a new era in reproductive genetics and healthcare.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"726-732"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142565100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sesame Genomic Web Resource (SesameGWR): a well-annotated data resource for transcriptomic signatures of abiotic and biotic stress responses in sesame (Sesamum indicum L.).","authors":"Himanshu Avashthi, Ulavappa Basavanneppa Angadi, Divya Chauhan, Anuj Kumar, Dwijesh Chandra Mishra, Parimalan Rangan, Rashmi Yadav, Dinesh Kumar","doi":"10.1093/bfgp/elae022","DOIUrl":"10.1093/bfgp/elae022","url":null,"abstract":"<p><p>Sesame (Sesamum indicum L.) is a globally cultivated oilseed crop renowned for its historical significance and widespread growth in tropical and subtropical regions. With notable nutritional and medicinal attributes, sesame has shown promising effects in combating malnutrition cancer, diabetes, and other diseases like cardiovascular problems. However, sesame production faces significant challenges from environmental threats such as charcoal rot, drought, salinity, and waterlogging stress, resulting in economic losses for farmers. The scarcity of information on stress-resistance genes and pathways exacerbates these challenges. Despite its immense importance, there is currently no platform available to provide comprehensive information on sesame, which significantly hinders the mining of various stress-associated genes and the molecular breeding of sesame. To address this gap, here a free, web-accessible, and user-friendly genomic web resource (SesameGWR, http://backlin.cabgrid.res.in/sesameGWR/) has been developed This platform provides key insights into differentially expressed genes, transcription factors, miRNAs, and molecular markers like simple sequence repeats, single nucleotide polymorphisms, and insertions and deletions associated with both biotic and abiotic stresses.. The functional genomics information and annotations embedded in this web resource were predicted through RNA-seq data analysis. Considering the impact of climate change and the nutritional and medicinal importance of sesame, this study is of utmost importance in understanding stress responses. SesameGWR will serve as a valuable tool for developing climate-resilient sesame varieties, thereby enhancing the productivity of this ancient oilseed crop.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"828-842"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141238358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A review: simulation tools for genome-wide interaction studies.","authors":"Junliang Shang, Anqi Xu, Mingyuan Bi, Yuanyuan Zhang, Feng Li, Jin-Xing Liu","doi":"10.1093/bfgp/elae034","DOIUrl":"10.1093/bfgp/elae034","url":null,"abstract":"<p><p>Genome-wide association study (GWAS) is essential for investigating the genetic basis of complex diseases; nevertheless, it usually ignores the interaction of multiple single nucleotide polymorphisms (SNPs). Genome-wide interaction studies provide crucial means for exploring complex genetic interactions that GWAS may miss. Although many interaction methods have been proposed, challenges still persist, including the lack of epistasis models and the inconsistency of benchmark datasets. SNP data simulation is a pivotal intermediary between interaction methods and real applications. Therefore, it is important to obtain epistasis models and benchmark datasets by simulation tools, which is helpful for further improving interaction methods. At present, many simulation tools have been widely employed in the field of population genetics. According to their basic principles, these existing tools can be divided into four categories: coalescent simulation, forward-time simulation, resampling simulation, and other simulation frameworks. In this paper, their basic principles and representative simulation tools are compared and analyzed in detail. Additionally, this paper provides a discussion and summary of the advantages and disadvantages of these frameworks and tools, offering technical insights for the design of new methods, and serving as valuable reference tools for researchers to comprehensively understand GWAS and genome-wide interaction studies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"745-753"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-omics integration analysis reveals the role of N6-methyladenosine in lncRNA translation during glioma stem cell differentiation.","authors":"Meng Zhang, Runqiu Cai, Jingjing Liu, Yulan Wang, Shan He, Quan Wang, Xiaofeng Song, Jing Wu, Jian Zhao","doi":"10.1093/bfgp/elae037","DOIUrl":"10.1093/bfgp/elae037","url":null,"abstract":"<p><p>Glioblastoma is one of the most lethal brain diseases in humans. Although recent studies have shown reciprocal interactions between N6-methyladenosine (m6A) modifications and long noncoding RNAs (lncRNAs) in gliomagenesis and malignant progression, the mechanism of m6A-mediated lncRNA translational regulation in glioblastoma remains unclear. Herein, we profiled the transcriptomes, translatomes, and epitranscriptomics of glioma stem cells and differentiated glioma cells to investigate the role of m6A in lncRNA translation comprehensively. We found that lncRNAs with numerous m6A peaks exhibit reduced translation efficiency. Transcript-level expression analysis demonstrates an enrichment of m6A around short open reading frames (sORFs) of translatable lncRNA transcripts. Further comparison analysis of m6A modifications in different RNA regions indicates that m6A peaks downstream of sORFs inhibit lncRNA translation more than those upstream. Observations in glioma-associated lncRNAs H19, LINC00467, and GAS5 further confirm the negative effect of m6A methylation on lncRNA translation. Overall, these findings elucidate the dynamic profiles of the m6A methylome and enhance the understanding of the complexity of lncRNA translational regulation.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"806-815"},"PeriodicalIF":2.5,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}