{"title":"视频文本处理的研究","authors":"Tianxue Zhao, Guangmin Sun, Cheng Zhang, Deming Chen","doi":"10.1109/ISIE.2008.4677025","DOIUrl":null,"url":null,"abstract":"This paper adopts combination of DWT and neural network methods to locate the text region. After analyzing the characteristic of video text, the feature of kurtosis is first used in the field of text extraction, which improves the locating accuracy considerably. The OCR accuracy is also improved by using the Shannon interpolation method and Niblack adaptive thresholding method to enhance the video text extracted. Experiments show the methods mentioned above improve the locating accuracy to about 90.4% and the OCR accuracy to about 85.1%.","PeriodicalId":262939,"journal":{"name":"2008 IEEE International Symposium on Industrial Electronics","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Study on video text processing\",\"authors\":\"Tianxue Zhao, Guangmin Sun, Cheng Zhang, Deming Chen\",\"doi\":\"10.1109/ISIE.2008.4677025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper adopts combination of DWT and neural network methods to locate the text region. After analyzing the characteristic of video text, the feature of kurtosis is first used in the field of text extraction, which improves the locating accuracy considerably. The OCR accuracy is also improved by using the Shannon interpolation method and Niblack adaptive thresholding method to enhance the video text extracted. Experiments show the methods mentioned above improve the locating accuracy to about 90.4% and the OCR accuracy to about 85.1%.\",\"PeriodicalId\":262939,\"journal\":{\"name\":\"2008 IEEE International Symposium on Industrial Electronics\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE International Symposium on Industrial Electronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISIE.2008.4677025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Symposium on Industrial Electronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIE.2008.4677025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper adopts combination of DWT and neural network methods to locate the text region. After analyzing the characteristic of video text, the feature of kurtosis is first used in the field of text extraction, which improves the locating accuracy considerably. The OCR accuracy is also improved by using the Shannon interpolation method and Niblack adaptive thresholding method to enhance the video text extracted. Experiments show the methods mentioned above improve the locating accuracy to about 90.4% and the OCR accuracy to about 85.1%.