{"title":"Conversion of Mate-Pair Reads into Long Sequences for Improving Assembly Scaffolding","authors":"Chao-Hung Lee, Cheng-Wei Tsai, Yao-Ting Huang","doi":"10.1109/ICS.2016.0018","DOIUrl":null,"url":null,"abstract":"Mate-pair sequencing is a technology for sequencing two ends of long DNA fragments, which has been widely used in genome scaffolding. Although the cost of mate-pair sequencing is now affordable, its accuracy has been limited by the lower quality and contamination. The 3rd generation sequencing is able to generate long reads for genome scaffolding. However, the error rates and cost are still too high. This paper aims to convert low-cost mate-pair reads into long reads using computational approaches, which has the benefits of both mate-pair reads and long reads for scaffolding. We test our methods by using several real datasets and validate the accuracy of converted long reads. In addition, the scaffolding results are compared using mate-pair reads, long reads, and mixture of both material.","PeriodicalId":281088,"journal":{"name":"2016 International Computer Symposium (ICS)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Computer Symposium (ICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICS.2016.0018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Mate-pair sequencing is a technology for sequencing two ends of long DNA fragments, which has been widely used in genome scaffolding. Although the cost of mate-pair sequencing is now affordable, its accuracy has been limited by the lower quality and contamination. The 3rd generation sequencing is able to generate long reads for genome scaffolding. However, the error rates and cost are still too high. This paper aims to convert low-cost mate-pair reads into long reads using computational approaches, which has the benefits of both mate-pair reads and long reads for scaffolding. We test our methods by using several real datasets and validate the accuracy of converted long reads. In addition, the scaffolding results are compared using mate-pair reads, long reads, and mixture of both material.