A bioinformatics pipeline for identifying homoplasmic and heteroplasmic mitochondrial DNA SNVs in single-cell RNA-Seq datasets.

IF 3 2区 生物学 Q2 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Zhiling Guan, Patrick Lindsey, Rick Kamps, Hubert J M Smeets
{"title":"A bioinformatics pipeline for identifying homoplasmic and heteroplasmic mitochondrial DNA SNVs in single-cell RNA-Seq datasets.","authors":"Zhiling Guan, Patrick Lindsey, Rick Kamps, Hubert J M Smeets","doi":"10.1016/j.ygeno.2025.111122","DOIUrl":null,"url":null,"abstract":"<p><p>Mitochondrial DNA (mtDNA) single nucleotide variants (SNVs) are associated with various pathologies, predominantly in energy-demanding tissues like muscles and brain. Characterizing these SNVs at the single-cell level is crucial for understanding their mechanism and clinical manifestation. Publicly available single-cell RNA sequencing (scRNA-seq) data could be an invaluable resource, but existing pipelines fall short in reliable detection of mtDNA SNVs from scRNA-seq data. Therefore, we developed a novel bioinformatics pipeline, that includes quality control, alignment to the mitochondrial genome, SNV calling, and annotation, and that filters-out sequencing errors. Coverage-dependent thresholds are customizable for detecting heteroplasmic SNVs. Duplicate reads can be retained as the majority were valid biological duplicates. Strand bias errors, exceeding a 1:3 ratio, RNA modification-induced errors, identified by the presence of multiple alternative alleles at the same position, and overrepresented SNVs were removed. Our data demonstrated that this pipeline effectively detects homoplasmic and heteroplasmic mtDNA SNVs in scRNA-Seq data.</p>","PeriodicalId":12521,"journal":{"name":"Genomics","volume":" ","pages":"111122"},"PeriodicalIF":3.0000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ygeno.2025.111122","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Mitochondrial DNA (mtDNA) single nucleotide variants (SNVs) are associated with various pathologies, predominantly in energy-demanding tissues like muscles and brain. Characterizing these SNVs at the single-cell level is crucial for understanding their mechanism and clinical manifestation. Publicly available single-cell RNA sequencing (scRNA-seq) data could be an invaluable resource, but existing pipelines fall short in reliable detection of mtDNA SNVs from scRNA-seq data. Therefore, we developed a novel bioinformatics pipeline, that includes quality control, alignment to the mitochondrial genome, SNV calling, and annotation, and that filters-out sequencing errors. Coverage-dependent thresholds are customizable for detecting heteroplasmic SNVs. Duplicate reads can be retained as the majority were valid biological duplicates. Strand bias errors, exceeding a 1:3 ratio, RNA modification-induced errors, identified by the presence of multiple alternative alleles at the same position, and overrepresented SNVs were removed. Our data demonstrated that this pipeline effectively detects homoplasmic and heteroplasmic mtDNA SNVs in scRNA-Seq data.

在单细胞RNA-Seq数据集中鉴定同质和异质线粒体DNA snv的生物信息学管道。
线粒体DNA (mtDNA)单核苷酸变异(snv)与多种病理有关,主要发生在肌肉和大脑等高能量组织中。在单细胞水平上表征这些snv对于了解其机制和临床表现至关重要。公开的单细胞RNA测序(scRNA-seq)数据可能是一种宝贵的资源,但现有的管道在从scRNA-seq数据中可靠地检测mtDNA snv方面存在不足。因此,我们开发了一种新的生物信息学管道,包括质量控制、线粒体基因组比对、SNV调用和注释,并过滤掉测序错误。覆盖率相关的阈值可定制用于检测异质性snv。重复的读取可以保留,因为大多数是有效的生物重复。去除超过1:3比例的链偏置错误、RNA修饰引起的错误(通过在同一位置存在多个替代等位基因来识别)和过度代表的snv。我们的数据表明,该管道可以有效地检测scRNA-Seq数据中的同质和异质mtDNA snv。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Genomics
Genomics 生物-生物工程与应用微生物
CiteScore
9.60
自引率
2.30%
发文量
260
审稿时长
60 days
期刊介绍: Genomics is a forum for describing the development of genome-scale technologies and their application to all areas of biological investigation. As a journal that has evolved with the field that carries its name, Genomics focuses on the development and application of cutting-edge methods, addressing fundamental questions with potential interest to a wide audience. Our aim is to publish the highest quality research and to provide authors with rapid, fair and accurate review and publication of manuscripts falling within our scope.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信