GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae055
César Piñeiro, Juan C Pichel
{"title":"Efficient phylogenetic tree inference for massive taxonomic datasets: harnessing the power of a server to analyze 1 million taxa.","authors":"César Piñeiro, Juan C Pichel","doi":"10.1093/gigascience/giae055","DOIUrl":"10.1093/gigascience/giae055","url":null,"abstract":"<p><strong>Background: </strong>Phylogenies play a crucial role in biological research. Unfortunately, the search for the optimal phylogenetic tree incurs significant computational costs, and most of the existing state-of-the-art tools cannot deal with extremely large datasets in reasonable times.</p><p><strong>Results: </strong>In this work, we introduce the new VeryFastTree code (version 4.0), which is able to construct a tree on 1 server using single-precision arithmetic from a massive 1 million alignment dataset in only 36 hours, which is 3 times and 3.2 times faster than its previous version and FastTree-2, respectively. This new version further boosts performance by parallelizing all tree traversal operations during the tree construction process, including subtree pruning and regrafting moves. Additionally, it introduces significant new features such as support for new and compressed file formats, enhanced compatibility across a broader range of operating systems, and the integration of disk computing functionality. The latter feature is particularly advantageous for users without access to high-end servers, as it allows them to manage very large datasets, albeit with an increase in computing time.</p><p><strong>Conclusions: </strong>Experimental results establish VeryFastTree as the fastest tool in the state-of-the-art for maximum likelihood phylogeny estimation. It is publicly available at https://github.com/citiususc/veryfasttree. In addition, VeryFastTree is included as a package in Bioconda, MacPorts, and all Debian-based Linux distributions.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141906391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dual-Alpha: a large EEG study for dual-frequency SSVEP brain-computer interface.","authors":"Yike Sun, Liyan Liang, Yuhan Li, Xiaogang Chen, Xiaorong Gao","doi":"10.1093/gigascience/giae041","DOIUrl":"10.1093/gigascience/giae041","url":null,"abstract":"<p><strong>Background: </strong>The domain of brain-computer interface (BCI) technology has experienced significant expansion in recent years. However, the field continues to face a pivotal challenge due to the dearth of high-quality datasets. This lack of robust datasets serves as a bottleneck, constraining the progression of algorithmic innovations and, by extension, the maturation of the BCI field.</p><p><strong>Findings: </strong>This study details the acquisition and compilation of electroencephalogram data across 3 distinct dual-frequency steady-state visual evoked potential (SSVEP) paradigms, encompassing over 100 participants. Each experimental condition featured 40 individual targets with 5 repetitions per target, culminating in a comprehensive dataset consisting of 21,000 trials of dual-frequency SSVEP recordings. We performed an exhaustive validation of the dataset through signal-to-noise ratio analyses and task-related component analysis, thereby substantiating its reliability and effectiveness for classification tasks.</p><p><strong>Conclusions: </strong>The extensive dataset presented is set to be a catalyst for the accelerated development of BCI technologies. Its significance extends beyond the BCI sphere and holds considerable promise for propelling research in psychology and neuroscience. The dataset is particularly invaluable for discerning the complex dynamics of binocular visual resource distribution.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11304967/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141901425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae054
Giulia Zancolli, Björn Marcus von Reumont, Gregor Anderluh, Figen Caliskan, Maria Luisa Chiusano, Jacob Fröhlich, Evroula Hapeshi, Benjamin-Florian Hempel, Maria P Ikonomopoulou, Florence Jungo, Pascale Marchot, Tarcisio Mendes de Farias, Maria Vittoria Modica, Yehu Moran, Ayse Nalbantsoy, Jan Procházka, Andrea Tarallo, Fiorella Tonello, Rui Vitorino, Mark Lawrence Zammit, Agostinho Antunes
{"title":"Web of venom: exploration of big data resources in animal toxin research.","authors":"Giulia Zancolli, Björn Marcus von Reumont, Gregor Anderluh, Figen Caliskan, Maria Luisa Chiusano, Jacob Fröhlich, Evroula Hapeshi, Benjamin-Florian Hempel, Maria P Ikonomopoulou, Florence Jungo, Pascale Marchot, Tarcisio Mendes de Farias, Maria Vittoria Modica, Yehu Moran, Ayse Nalbantsoy, Jan Procházka, Andrea Tarallo, Fiorella Tonello, Rui Vitorino, Mark Lawrence Zammit, Agostinho Antunes","doi":"10.1093/gigascience/giae054","DOIUrl":"10.1093/gigascience/giae054","url":null,"abstract":"<p><p>Research on animal venoms and their components spans multiple disciplines, including biology, biochemistry, bioinformatics, pharmacology, medicine, and more. Manipulating and analyzing the diverse array of data required for venom research can be challenging, and relevant tools and resources are often dispersed across different online platforms, making them less accessible to nonexperts. In this article, we address the multifaceted needs of the scientific community involved in venom and toxin-related research by identifying and discussing web resources, databases, and tools commonly used in this field. We have compiled these resources into a comprehensive table available on the VenomZone website (https://venomzone.expasy.org/10897). Furthermore, we highlight the challenges currently faced by researchers in accessing and using these resources and emphasize the importance of community-driven interdisciplinary approaches. We conclude by underscoring the significance of enhancing standards, promoting interoperability, and encouraging data and method sharing within the venom research community.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11382406/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142153664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae071
Abigayle Simpson, Elisha M Wood-Charlson, Montana Smith, Benjamin J Koch, Kathleen Beilsmith, Jeffrey A Kimbrel, Matthew Kellom, Christopher I Hunter, Ramona L Walls, Lynn M Schriml, Roland C Wilhelm
{"title":"MISIP: a data standard for the reuse and reproducibility of any stable isotope probing-derived nucleic acid sequence and experiment.","authors":"Abigayle Simpson, Elisha M Wood-Charlson, Montana Smith, Benjamin J Koch, Kathleen Beilsmith, Jeffrey A Kimbrel, Matthew Kellom, Christopher I Hunter, Ramona L Walls, Lynn M Schriml, Roland C Wilhelm","doi":"10.1093/gigascience/giae071","DOIUrl":"https://doi.org/10.1093/gigascience/giae071","url":null,"abstract":"<p><p>DNA/RNA-stable isotope probing (SIP) is a powerful tool to link in situ microbial activity to sequencing data. Every SIP dataset captures distinct information about microbial community metabolism, process rates, and population dynamics, offering valuable insights for a wide range of research questions. Data reuse maximizes the information derived from the labor and resource-intensive SIP approaches. Yet, a review of publicly available SIP sequencing metadata showed that critical information necessary for reproducibility and reuse was often missing. Here, we outline the Minimum Information for any Stable Isotope Probing Sequence (MISIP) according to the Minimum Information for any (x) Sequence (MIxS) framework and include examples of MISIP reporting for common SIP experiments. Our objectives are to expand the capacity of MIxS to accommodate SIP-specific metadata and guide SIP users in metadata collection when planning and reporting an experiment. The MISIP standard requires 5 metadata fields-isotope, isotopolog, isotopolog label, labeling approach, and gradient position-and recommends several fields that represent best practices in acquiring and reporting SIP sequencing data (e.g., gradient density and nucleic acid amount). The standard is intended to be used in concert with other MIxS checklists to comprehensively describe the origin of sequence data, such as for marker genes (MISIP-MIMARKS) or metagenomes (MISIP-MIMS), in combination with metadata required by an environmental extension (e.g., soil). The adoption of the proposed data standard will improve the reuse of any sequence derived from a SIP experiment and, by extension, deepen understanding of in situ biogeochemical processes and microbial ecology.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471955/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142462742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae067
Denghua Yin, Chunhai Chen, Danqing Lin, Zhong Hua, Congping Ying, Jialu Zhang, Chenxi Zhao, Yan Liu, Zhichen Cao, Han Zhang, Chenhe Wang, Liping Liang, Pao Xu, Jianbo Jian, Kai Liu
{"title":"Telomere-to-telomere gap-free genome assembly of the endangered Yangtze finless porpoise and East Asian finless porpoise.","authors":"Denghua Yin, Chunhai Chen, Danqing Lin, Zhong Hua, Congping Ying, Jialu Zhang, Chenxi Zhao, Yan Liu, Zhichen Cao, Han Zhang, Chenhe Wang, Liping Liang, Pao Xu, Jianbo Jian, Kai Liu","doi":"10.1093/gigascience/giae067","DOIUrl":"10.1093/gigascience/giae067","url":null,"abstract":"<p><strong>Background: </strong>The Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis, YFP) and the East Asian finless porpoise (Neophocaena asiaeorientalis sunameri, EFP) are 2 subspecies of the narrow-ridged finless porpoise that live in freshwater and saltwater, respectively. The main objective of this study was to provide contiguous chromosome-level genome assemblies for YFP and EFP.</p><p><strong>Results: </strong>Here, we generated and upgraded the genomes of YFP and EFP at the telomere-to-telomere level through the integration of PacBio HiFi long reads, ultra-long ONT reads, and Hi-C sequencing data with a total size of 2.48 Gb and 2.50 Gb, respectively. The scaffold N50 of 2 genomes was 125.12 Mb (YFP) and 128 Mb (EFP) with 1 contig for 1 chromosome. The telomere repeat and centromere position were clearly identified in both YFP and EFP genomes. In total, 5,480 newfound genes were detected in the YFP genome, including 56 genes located in the newly identified centromere regions. Additionally, synteny blocks, structural similarities, phylogenetic relationships, gene family expansion, and inference of selection were studied in connection with the genomes of other related mammals.</p><p><strong>Conclusions: </strong>Our research findings provide evidence for the gradual adaptation of EFP in a marine environment and the potential sensitivity of YFP to genetic damage. Compared to the 34 cetacean genomes sourced from public databases, the 2 new assemblies demonstrate superior continuity with the longest contig N50 and scaffold N50 values, as well as the lowest number of contigs. The improvement of telomere-to-telomere gap-free reference genome resources supports conservation genetics and population management for finless porpoises.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11403816/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142283911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae048
Yiming Zhang, Yunxiao Shen, Pengyu Jin, Bingyue Zhu, Yejie Lin, Tongyao Jiang, Xianting Huang, Yang Wang, Zhe Zhao, Shuqiang Li
{"title":"A trade-off in evolution: the adaptive landscape of spiders without venom glands.","authors":"Yiming Zhang, Yunxiao Shen, Pengyu Jin, Bingyue Zhu, Yejie Lin, Tongyao Jiang, Xianting Huang, Yang Wang, Zhe Zhao, Shuqiang Li","doi":"10.1093/gigascience/giae048","DOIUrl":"10.1093/gigascience/giae048","url":null,"abstract":"<p><strong>Background: </strong>Venom glands play a key role in the predation and defense strategies of almost all spider groups. However, the spider family Uloboridae lacks venom glands and has evolved an adaptive strategy: they excessively wrap their prey directly with spider silk instead of paralyzing it first with toxins. This shift in survival strategy is very fascinating, but the genetic underpinnings behind it are poorly understood.</p><p><strong>Results: </strong>Spanning multiple spider groups, we conducted multiomics analyses on Octonoba sinensis and described the adaptive evolution of the Uloboridae family at the genome level. We observed the coding genes of myosin and twitchin in muscles are under positive selection, energy metabolism functions are enhanced, and gene families related to tracheal development and tissue mechanical strength are expanded or emerged, all of which are related to the unique anatomical structure and predatory behavior of spiders in the family Uloboridae. In addition, we also scanned the elements that are absent or under relaxed purifying selection, as well as toxin gene homologs in the genomes of 2 species in this family. The results show that the absence of regions and regions under relaxed selection in these spiders' genomes are concentrated in areas related to development and neurosystem. The search for toxin homologs reveals possible gene function shift between toxins and nontoxins and confirms that there are no reliable toxin genes in the genome of this group.</p><p><strong>Conclusions: </strong>This study demonstrates the trade-off between different predation strategies in spiders, using either chemical or physical strategy, and provides insights into the possible mechanism underlying this trade-off. Venomless spiders need to mobilize multiple developmental and metabolic pathways related to motor function and limb mechanical strength to cover the decline in adaptability caused by the absence of venom glands.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11299198/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141888975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae051
Eva Aßmann, Shelesh Agrawal, Laura Orschler, Sindy Böttcher, Susanne Lackner, Martin Hölzer
{"title":"Impact of reference design on estimating SARS-CoV-2 lineage abundances from wastewater sequencing data.","authors":"Eva Aßmann, Shelesh Agrawal, Laura Orschler, Sindy Böttcher, Susanne Lackner, Martin Hölzer","doi":"10.1093/gigascience/giae051","DOIUrl":"10.1093/gigascience/giae051","url":null,"abstract":"<p><strong>Background: </strong>Sequencing of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA from wastewater samples has emerged as a valuable tool for detecting the presence and relative abundances of SARS-CoV-2 variants in a community. By analyzing the viral genetic material present in wastewater, researchers and public health authorities can gain early insights into the spread of virus lineages and emerging mutations. Constructing reference datasets from known SARS-CoV-2 lineages and their mutation profiles has become state-of-the-art for assigning viral lineages and their relative abundances from wastewater sequencing data. However, selecting reference sequences or mutations directly affects the predictive power.</p><p><strong>Results: </strong>Here, we show the impact of a mutation- and sequence-based reference reconstruction for SARS-CoV-2 abundance estimation. We benchmark 3 datasets: (i) synthetic \"spike-in\"' mixtures; (ii) German wastewater samples from early 2021, mainly comprising Alpha; and (iii) samples obtained from wastewater at an international airport in Germany from the end of 2021, including first signals of Omicron. The 2 approaches differ in sublineage detection, with the marker mutation-based method, in particular, being challenged by the increasing number of mutations and lineages. However, the estimations of both approaches depend on selecting representative references and optimized parameter settings. By performing parameter escalation experiments, we demonstrate the effects of reference size and alternative allele frequency cutoffs for abundance estimation. We show how different parameter settings can lead to different results for our test datasets and illustrate the effects of virus lineage composition of wastewater samples and references.</p><p><strong>Conclusions: </strong>Our study highlights current computational challenges, focusing on the general reference design, which directly impacts abundance allocations. We illustrate advantages and disadvantages that may be relevant for further developments in the wastewater community and in the context of defining robust quality metrics.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11308188/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141906392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae025
Mingzhou Bai, Sanjie Jiang, Shanshan Chu, Yangyang Yu, Dai Shan, Chun Liu, Liang Zong, Qun Liu, Nana Liu, Weisong Xu, Zhanlong Mei, Jianbo Jian, Chi Zhang, Shancen Zhao, Tsan-Yu Chiu, Henrik Toft Simonsen
{"title":"The telomere-to-telomere (T2T) genome of Peucedanum praeruptorum Dunn provides insights into the genome evolution and coumarin biosynthesis.","authors":"Mingzhou Bai, Sanjie Jiang, Shanshan Chu, Yangyang Yu, Dai Shan, Chun Liu, Liang Zong, Qun Liu, Nana Liu, Weisong Xu, Zhanlong Mei, Jianbo Jian, Chi Zhang, Shancen Zhao, Tsan-Yu Chiu, Henrik Toft Simonsen","doi":"10.1093/gigascience/giae025","DOIUrl":"10.1093/gigascience/giae025","url":null,"abstract":"<p><strong>Background: </strong>Traditional Chinese medicine has used Peucedanum praeruptorum Dunn (Apiaceae) for a long time. Various coumarins, including the significant constituents praeruptorin (A-E), are the active constituents in the dried roots of P. praeruptorum. Previous transcriptomic and metabolomic studies have attempted to elucidate the distribution and biosynthetic network of these medicinal-valuable compounds. However, the lack of a high-quality reference genome impedes an in-depth understanding of genetic traits and thus the development of better breeding strategies.</p><p><strong>Results: </strong>A telomere-to-telomere (T2T) genome was assembled for P. praeruptorum by combining PacBio HiFi, ONT ultra-long, and Hi-C data. The final genome assembly was approximately 1.798 Gb, assigned to 11 chromosomes with genome completeness >98%. Comparative genomic analysis suggested that P. praeruptorum experienced 2 whole-genome duplication events. By the transcriptomic and metabolomic analysis of the coumarin metabolic pathway, we presented coumarins' spatial and temporal distribution and the expression patterns of critical genes for its biosynthesis. Notably, the COSY and cytochrome P450 genes showed tandem duplications on several chromosomes, which may be responsible for the high accumulation of coumarins.</p><p><strong>Conclusions: </strong>A T2T genome for P. praeruptorum was obtained, providing molecular insights into the chromosomal distribution of the coumarin biosynthetic genes. This high-quality genome is an essential resource for designing engineering strategies for improving the production of these valuable compounds.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11152176/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141261607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giae037
Jakub Laznovsky, Michaela Kavkova, Alice Helena Reis, Pavla Robovska-Havelkova, Lorena Agostini Maia, Jan Krivanek, Tomas Zikmund, Jozef Kaiser, Marcela Buchtova, Jakub Harnos
{"title":"Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging.","authors":"Jakub Laznovsky, Michaela Kavkova, Alice Helena Reis, Pavla Robovska-Havelkova, Lorena Agostini Maia, Jan Krivanek, Tomas Zikmund, Jozef Kaiser, Marcela Buchtova, Jakub Harnos","doi":"10.1093/gigascience/giae037","DOIUrl":"10.1093/gigascience/giae037","url":null,"abstract":"<p><strong>Background: </strong>Xenopus laevis, the African clawed frog, is a versatile vertebrate model organism in various biological disciplines, prominently in developmental biology to study body plan reorganization during metamorphosis. However, a notable gap exists in the availability of comprehensive datasets encompassing Xenopus' late developmental stages.</p><p><strong>Findings: </strong>This study utilized micro-computed tomography (micro-CT), a noninvasive 3-dimensional (3D) imaging technique with micrometer-scale resolution, to explore the developmental dynamics and morphological changes in Xenopus laevis. Our approach involved generating high-resolution images and computed 3D models of developing Xenopus specimens, spanning from premetamorphosis tadpoles to fully mature adults. This dataset enhances our understanding of vertebrate development and supports various analyses. We conducted a careful examination, analyzing body size, shape, and morphological features, focusing on skeletogenesis, teeth, and organs like the brain and gut at different stages. Our analysis yielded valuable insights into 3D morphological changes during Xenopus' development, documenting details previously unrecorded. These datasets hold the solid potential for further morphological and morphometric analyses, including segmentation of hard and soft tissues.</p><p><strong>Conclusions: </strong>Our repository of micro-CT scans represents a significant resource that can enhance our understanding of Xenopus' development and the associated morphological changes in the future. The widespread utility of this amphibian species, coupled with the exceptional quality of our scans, which encompass a comprehensive series of developmental stages, opens up extensive opportunities for their broader research application. Moreover, these scans can be used in virtual reality, 3D printing, and educational contexts, further expanding their value and impact.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141626445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
GigaSciencePub Date : 2024-01-02DOI: 10.1093/gigascience/giad117
Junyi Chen, Danqing Yin, Harris Y H Wong, Xin Duan, Ken H O Yu, Joshua W K Ho
{"title":"Vulture: cloud-enabled scalable mining of microbial reads in public scRNA-seq data.","authors":"Junyi Chen, Danqing Yin, Harris Y H Wong, Xin Duan, Ken H O Yu, Joshua W K Ho","doi":"10.1093/gigascience/giad117","DOIUrl":"10.1093/gigascience/giad117","url":null,"abstract":"<p><p>The rapidly growing collection of public single-cell sequencing data has become a valuable resource for molecular, cellular, and microbial discovery. Previous studies mostly overlooked detecting pathogens in human single-cell sequencing data. Moreover, existing bioinformatics tools lack the scalability to deal with big public data. We introduce Vulture, a scalable cloud-based pipeline that performs microbial calling for single-cell RNA sequencing (scRNA-seq) data, enabling meta-analysis of host-microbial studies from the public domain. In our benchmarking experiments, Vulture is 66% to 88% faster than local tools (PathogenTrack and Venus) and 41% faster than the state-of-the-art cloud-based tool Cumulus, while achieving comparable microbial read identification. In terms of the cost on cloud computing systems, Vulture also shows a cost reduction of 83% ($12 vs. ${$}$70). We applied Vulture to 2 coronavirus disease 2019, 3 hepatocellular carcinoma (HCC), and 2 gastric cancer human patient cohorts with public sequencing reads data from scRNA-seq experiments and discovered cell type-specific enrichment of severe acute respiratory syndrome coronavirus 2, hepatitis B virus (HBV), and Helicobacter pylori-positive cells, respectively. In the HCC analysis, all cohorts showed hepatocyte-only enrichment of HBV, with cell subtype-associated HBV enrichment based on inferred copy number variations. In summary, Vulture presents a scalable and economical framework to mine unknown host-microbial interactions from large-scale public scRNA-seq data. Vulture is available via an open-source license at https://github.com/holab-hku/Vulture.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10776309/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139402560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}