{"title":"Pulse Coupled Neural Network Edge-Based Algorithm for Image Text Locating*","authors":"Zhang Xin (张昕), Sun Fuchun (孙富春)","doi":"10.1016/S1007-0214(11)70004-9","DOIUrl":null,"url":null,"abstract":"<div><p>This paper presents a method for locating text based on a simplified pulse coupled neural network (PCNN). The PCNN generates a firings map in a similar way to the human visual system<span> with non-linear image processing<span>. The PCNN is used to segment the original image into different planes and edges detected using both the PCNN firings map and a phase congruency detector. The different edges are integrated using an automatically adjusted weighting coefficient. Both the simplified PCNN and the phase congruency energy model in the frequency domain imitate the human visual system. This paper shows how to use PCNN by changing the compute space from the spatial domain to the frequency domain for solving the text location problem. The algorithm is a simplified PCNN edge-based (PCNNE) algorithm. Three comparison tests are used to evaluate the algorithm. Tests on large data sets show PCNNE efficiently detects texts with various colors, font sizes, positions, and uneven illumination. This method outperforms several traditional methods both in text detection rate and text detection accuracy.</span></span></p></div>","PeriodicalId":60306,"journal":{"name":"Tsinghua Science and Technology","volume":null,"pages":null},"PeriodicalIF":5.2000,"publicationDate":"2011-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1007-0214(11)70004-9","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"1093","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1007021411700049","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 6
Abstract
This paper presents a method for locating text based on a simplified pulse coupled neural network (PCNN). The PCNN generates a firings map in a similar way to the human visual system with non-linear image processing. The PCNN is used to segment the original image into different planes and edges detected using both the PCNN firings map and a phase congruency detector. The different edges are integrated using an automatically adjusted weighting coefficient. Both the simplified PCNN and the phase congruency energy model in the frequency domain imitate the human visual system. This paper shows how to use PCNN by changing the compute space from the spatial domain to the frequency domain for solving the text location problem. The algorithm is a simplified PCNN edge-based (PCNNE) algorithm. Three comparison tests are used to evaluate the algorithm. Tests on large data sets show PCNNE efficiently detects texts with various colors, font sizes, positions, and uneven illumination. This method outperforms several traditional methods both in text detection rate and text detection accuracy.