自动视频文本定位和识别

Fourth International Conference on Image and Graphics (ICIG 2007) Pub Date : 2007-08-22 DOI:10.1109/ICIG.2007.62

Ge Guo, Jin Jin, X. Ping, Zhang Tao

{"title":"自动视频文本定位和识别","authors":"Ge Guo, Jin Jin, X. Ping, Zhang Tao","doi":"10.1109/ICIG.2007.62","DOIUrl":null,"url":null,"abstract":"Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.","PeriodicalId":367106,"journal":{"name":"Fourth International Conference on Image and Graphics (ICIG 2007)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Automatic Video Text Localization and Recognition\",\"authors\":\"Ge Guo, Jin Jin, X. Ping, Zhang Tao\",\"doi\":\"10.1109/ICIG.2007.62\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.\",\"PeriodicalId\":367106,\"journal\":{\"name\":\"Fourth International Conference on Image and Graphics (ICIG 2007)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fourth International Conference on Image and Graphics (ICIG 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIG.2007.62\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Conference on Image and Graphics (ICIG 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIG.2007.62","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

摘要

视频中的文本包含了大量的语义信息，这些信息可以用于视频索引和摘要。本文设计了一种基于角点检测和颜色聚类的水平文本定位综合算法。首先利用基于角点检测的方法得到候选文本区域，然后利用颜色聚类方法对候选文本区域进行识别，并对边界框进行细化。该方法不仅提高了定位的准确率和查全率，而且缩短了定位的处理时间。在定位精度方面，新方法给出了更紧密的边界框。最后，我们通过多帧平均和局部阈值分割来提高检测文本区域的质量。我们的方法可以处理复杂背景的多语言视频文本，包括大范围的字体大小和样式。以上步骤后的结果可直接由OCR系统处理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic Video Text Localization and Recognition

Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Fourth International Conference on Image and Graphics (ICIG 2007)

自引率

0.00%

发文量