{"title":"Polony Identification Using the EM Algorithm Based on a Gaussian Mixture Model","authors":"Wei Li, Paul M. Ruegger, J. Borneman, Tao Jiang","doi":"10.1109/BIBE.2010.43","DOIUrl":null,"url":null,"abstract":"Polony technology is a low-cost, high-throughput platform employed in several applications such as DNA sequencing, haplotyping and alternative pre-mRNA splicing analysis. Owing to their random placement, however, overlapping polonies occur often and may result in inaccurate or unusable data. Accurately identifying polony positions and sizes is essential for maximizing the quantity and quality of data aquired in an image, however, most existing identification algorithms do not handle overlapping polonies well. In this paper, we present a novel polony identification approach combining both a Gaussian Mixture Model (GMM) and the Expectation-Maximization (EM) algorithm. Experiments on simulated and real images of highly overlapping polonies show that our algorithm has a 10% to 20% increase in recall compared with the existing algorithms, while keeping precision at the same level.","PeriodicalId":330904,"journal":{"name":"2010 IEEE International Conference on BioInformatics and BioEngineering","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on BioInformatics and BioEngineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2010.43","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Polony technology is a low-cost, high-throughput platform employed in several applications such as DNA sequencing, haplotyping and alternative pre-mRNA splicing analysis. Owing to their random placement, however, overlapping polonies occur often and may result in inaccurate or unusable data. Accurately identifying polony positions and sizes is essential for maximizing the quantity and quality of data aquired in an image, however, most existing identification algorithms do not handle overlapping polonies well. In this paper, we present a novel polony identification approach combining both a Gaussian Mixture Model (GMM) and the Expectation-Maximization (EM) algorithm. Experiments on simulated and real images of highly overlapping polonies show that our algorithm has a 10% to 20% increase in recall compared with the existing algorithms, while keeping precision at the same level.