使用“Unikseq2”的大型基因组数据集中的保守序列鉴定:在环境DNA分析开发中的应用。

IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing
{"title":"使用“Unikseq2”的大型基因组数据集中的保守序列鉴定:在环境DNA分析开发中的应用。","authors":"Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing","doi":"10.1111/1755-0998.70014","DOIUrl":null,"url":null,"abstract":"<p><p>Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e70014"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Conserved Sequence Identification Within Large Genomic Datasets Using 'Unikseq2': Application in Environmental DNA Assay Development.\",\"authors\":\"Mark Louie D Lopez, René L Warren, Michael J Allison, Lauren Coombe, Jacob J Imbery, Inanc Birol, Caren C Helbing\",\"doi\":\"10.1111/1755-0998.70014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.</p>\",\"PeriodicalId\":211,\"journal\":{\"name\":\"Molecular Ecology Resources\",\"volume\":\" \",\"pages\":\"e70014\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Ecology Resources\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1111/1755-0998.70014\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.70014","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

鉴定保守的基因组序列并利用它们作为进化支检测和/或特征的锚点是生态学研究的主要内容。对于环境DNA (eDNA)分析,有效处理大型基因组数据集对于生物多样性监测中可靠的物种检测至关重要。虽然相当多的焦点一直放在开发强大的物种靶向分析上,但具有更广泛分类覆盖范围的eDNA分析(例如,检测一个分类群体中的任何物种,如鱼类)可以显著简化环境监测,特别是当检测单个物种的DNA被证明具有挑战性时。设计这样的分析需要识别代表目标分类群的保守区域,这是一项主要的手工任务,通常是劳动密集型的,容易出错,特别是在处理大型序列数据集时。为了解决这些挑战,我们提出了unikseq2,一个增强的,无比对的,基于k-mer的工具,用于识别独特和保守序列。它引入了一个新的功能来识别目标物种之间的序列保守性,为通用引物设计等应用提供了更明智的标记选择。这自动化了大规模线粒体基因组数据集的序列选择,消除了人工检查计算成本高的多个序列比对的需要。在这里,我们通过开发和验证不同分类群的eDNA分析来展示unikseq2的能力,包括骨鱼科(硬骨鱼类)、鲑鱼科(鲑鱼和鳟鱼)、Myotis蝙蝠和鹿。基于unikseq2的eDNA分析允许从属到类的多个分类水平的准确检测,增强环境监测中eDNA工具的灵活性,可扩展性和可靠性。通过利用来自公共存储库的基因组数据,unikseq2支持高效,可重复的分析设计,使其成为广泛的生态和生物多样性研究应用的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Conserved Sequence Identification Within Large Genomic Datasets Using 'Unikseq2': Application in Environmental DNA Assay Development.

Identification of conserved genomic sequences and their utilisation as anchor points for clade detection and/or characterisation is a mainstay in ecological studies. For environmental DNA (eDNA) assays, effective processing of large genomic datasets is crucial for reliable species detection in biodiversity monitoring. While considerable focus has been on developing robust species-targeted assays, eDNA assays with broader taxonomic coverage (e.g., detecting any species within a taxonomic group such as fish), can significantly streamline environmental monitoring, especially when detecting individual species' DNA proves challenging. Designing such assays requires identifying conserved regions representing the target taxonomic group, a chiefly manual task that is often labor-intensive and error-prone, particularly when working with large sequence datasets. To address these challenges, we present unikseq2, an enhanced, alignment-free, k-mer-based tool for identifying unique and conserved sequences. It introduces a new functionality to identify sequence conservation among target species, enabling more informed marker selection for applications such as universal primer design. This automates sequence selection in large-scale mitochondrial genome datasets eliminating the need for manual inspection of computationally costly multiple sequence alignments. Herein, we demonstrate unikseq2's capabilities by developing and validating eDNA assays for various taxa, including Osteichthyes (bony fishes), the Salmonidae family (salmon and trout), Myotis bats and Cervus deer. Unikseq2-based eDNA assays allow for accurate detection across multiple taxonomic levels, from genus to class, enhancing the flexibility, scalability and reliability of eDNA tools in environmental monitoring. By leveraging genomic data from public repositories, unikseq2 supports efficient, reproducible assay design, making it an invaluable tool for a wide range of ecological and biodiversity research applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecular Ecology Resources
Molecular Ecology Resources 生物-进化生物学
CiteScore
15.60
自引率
5.20%
发文量
170
审稿时长
3 months
期刊介绍: Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines. In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信