Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing
{"title":"使用“Unikseq2”的大型基因组数据集中的保守序列鉴定:在环境DNA分析开发中的应用。","authors":"Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing","doi":"10.1111/1755-0998.70014","DOIUrl":null,"url":null,"abstract":"<p><p>Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e70014"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conserved Sequence Identification Within Large Genomic Datasets Using 'Unikseq2': Application in Environmental DNA Assay Development.\",\"authors\":\"Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing\",\"doi\":\"10.1111/1755-0998.70014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.</p>\",\"PeriodicalId\":211,\"journal\":{\"name\":\"Molecular Ecology Resources\",\"volume\":\" \",\"pages\":\"e70014\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Ecology Resources\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1111/1755-0998.70014\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.70014","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Conserved Sequence Identification Within Large Genomic Datasets Using 'Unikseq2': Application in Environmental DNA Assay Development.
Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.
期刊介绍:
Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines.
In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.