Mario Banuelos, Lasith Adhikari, R. Almanza, Andrew Fujikawa, Jonathan Sahagun, Katharine Sanderson, M. Spence, Suzanne S. Sindi, Roummel F. Marcia
{"title":"Sparse diploid spatial biosignal recovery for genomic variation detection","authors":"Mario Banuelos, Lasith Adhikari, R. Almanza, Andrew Fujikawa, Jonathan Sahagun, Katharine Sanderson, M. Spence, Suzanne S. Sindi, Roummel F. Marcia","doi":"10.1109/MeMeA.2017.7985888","DOIUrl":null,"url":null,"abstract":"Structural variants (SVs) - such as duplications, deletions and inversions - are rearrangements of an individual's genome relative to a given reference. The common method for detection of SVs is to sequence fragments from an individual's genome, map them to the appropriate reference and, by identifying discordant mappings, predict the locations and type of SV. However, errors in both the sequencing and mapping process will result in signals that look like SVs, resulting in inaccurate predictions. In addition, because of variation in sequencing coverage even when the evidence of an SV is present, determining if an individual has the SV present on one or both of their chromosomes is challenging. In our work, we seek to improve upon standard methods for SV detection in three ways. First, to reduce false-positive predictions, we simultaneously predict SVs in a parent and child using properties of inheritance to constrain the space of possible SVs. Second, we predict if a variant is homozygous (SV is on two chromosomes) or heterozygous (SV is on one chromosome). Third, we utilize a gradient-based optimization approach and constrain our solution with a sparsity-promoting ℓ1 penalty (since SV instances should be rare). We demonstrate the improved performance of our computational approach on both simulated genomes as well as a parent-child trio from the 1000 Genomes Project.","PeriodicalId":235051,"journal":{"name":"2017 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Symposium on Medical Measurements and Applications (MeMeA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MeMeA.2017.7985888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Structural variants (SVs) - such as duplications, deletions and inversions - are rearrangements of an individual's genome relative to a given reference. The common method for detection of SVs is to sequence fragments from an individual's genome, map them to the appropriate reference and, by identifying discordant mappings, predict the locations and type of SV. However, errors in both the sequencing and mapping process will result in signals that look like SVs, resulting in inaccurate predictions. In addition, because of variation in sequencing coverage even when the evidence of an SV is present, determining if an individual has the SV present on one or both of their chromosomes is challenging. In our work, we seek to improve upon standard methods for SV detection in three ways. First, to reduce false-positive predictions, we simultaneously predict SVs in a parent and child using properties of inheritance to constrain the space of possible SVs. Second, we predict if a variant is homozygous (SV is on two chromosomes) or heterozygous (SV is on one chromosome). Third, we utilize a gradient-based optimization approach and constrain our solution with a sparsity-promoting ℓ1 penalty (since SV instances should be rare). We demonstrate the improved performance of our computational approach on both simulated genomes as well as a parent-child trio from the 1000 Genomes Project.