Zhiling Guan, Patrick Lindsey, Rick Kamps, Hubert J M Smeets
{"title":"A bioinformatics pipeline for identifying homoplasmic and heteroplasmic mitochondrial DNA SNVs in single-cell RNA-Seq datasets.","authors":"Zhiling Guan, Patrick Lindsey, Rick Kamps, Hubert J M Smeets","doi":"10.1016/j.ygeno.2025.111122","DOIUrl":null,"url":null,"abstract":"<p><p>Mitochondrial DNA (mtDNA) single nucleotide variants (SNVs) are associated with various pathologies, predominantly in energy-demanding tissues like muscles and brain. Characterizing these SNVs at the single-cell level is crucial for understanding their mechanism and clinical manifestation. Publicly available single-cell RNA sequencing (scRNA-seq) data could be an invaluable resource, but existing pipelines fall short in reliable detection of mtDNA SNVs from scRNA-seq data. Therefore, we developed a novel bioinformatics pipeline, that includes quality control, alignment to the mitochondrial genome, SNV calling, and annotation, and that filters-out sequencing errors. Coverage-dependent thresholds are customizable for detecting heteroplasmic SNVs. Duplicate reads can be retained as the majority were valid biological duplicates. Strand bias errors, exceeding a 1:3 ratio, RNA modification-induced errors, identified by the presence of multiple alternative alleles at the same position, and overrepresented SNVs were removed. Our data demonstrated that this pipeline effectively detects homoplasmic and heteroplasmic mtDNA SNVs in scRNA-Seq data.</p>","PeriodicalId":12521,"journal":{"name":"Genomics","volume":" ","pages":"111122"},"PeriodicalIF":3.0000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ygeno.2025.111122","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Mitochondrial DNA (mtDNA) single nucleotide variants (SNVs) are associated with various pathologies, predominantly in energy-demanding tissues like muscles and brain. Characterizing these SNVs at the single-cell level is crucial for understanding their mechanism and clinical manifestation. Publicly available single-cell RNA sequencing (scRNA-seq) data could be an invaluable resource, but existing pipelines fall short in reliable detection of mtDNA SNVs from scRNA-seq data. Therefore, we developed a novel bioinformatics pipeline, that includes quality control, alignment to the mitochondrial genome, SNV calling, and annotation, and that filters-out sequencing errors. Coverage-dependent thresholds are customizable for detecting heteroplasmic SNVs. Duplicate reads can be retained as the majority were valid biological duplicates. Strand bias errors, exceeding a 1:3 ratio, RNA modification-induced errors, identified by the presence of multiple alternative alleles at the same position, and overrepresented SNVs were removed. Our data demonstrated that this pipeline effectively detects homoplasmic and heteroplasmic mtDNA SNVs in scRNA-Seq data.
期刊介绍:
Genomics is a forum for describing the development of genome-scale technologies and their application to all areas of biological investigation.
As a journal that has evolved with the field that carries its name, Genomics focuses on the development and application of cutting-edge methods, addressing fundamental questions with potential interest to a wide audience. Our aim is to publish the highest quality research and to provide authors with rapid, fair and accurate review and publication of manuscripts falling within our scope.