自动视频文本定位和识别

Ge Guo, Jin Jin, X. Ping, Zhang Tao
{"title":"自动视频文本定位和识别","authors":"Ge Guo, Jin Jin, X. Ping, Zhang Tao","doi":"10.1109/ICIG.2007.62","DOIUrl":null,"url":null,"abstract":"Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.","PeriodicalId":367106,"journal":{"name":"Fourth International Conference on Image and Graphics (ICIG 2007)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Automatic Video Text Localization and Recognition\",\"authors\":\"Ge Guo, Jin Jin, X. Ping, Zhang Tao\",\"doi\":\"10.1109/ICIG.2007.62\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.\",\"PeriodicalId\":367106,\"journal\":{\"name\":\"Fourth International Conference on Image and Graphics (ICIG 2007)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fourth International Conference on Image and Graphics (ICIG 2007)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIG.2007.62\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Conference on Image and Graphics (ICIG 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIG.2007.62","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

摘要

视频中的文本包含了大量的语义信息,这些信息可以用于视频索引和摘要。本文设计了一种基于角点检测和颜色聚类的水平文本定位综合算法。首先利用基于角点检测的方法得到候选文本区域,然后利用颜色聚类方法对候选文本区域进行识别,并对边界框进行细化。该方法不仅提高了定位的准确率和查全率,而且缩短了定位的处理时间。在定位精度方面,新方法给出了更紧密的边界框。最后,我们通过多帧平均和局部阈值分割来提高检测文本区域的质量。我们的方法可以处理复杂背景的多语言视频文本,包括大范围的字体大小和样式。以上步骤后的结果可直接由OCR系统处理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic Video Text Localization and Recognition
Text in videos contains much semantic information that can be used for video indexing and summarization. In this paper, we design an integrated algorithm of locating horizontal text based on corner point detection and color clustering. First, we get candidate text regions by using the method based on corner point detection, and then identify candidate text regions and refine the bounding boxes by color clustering. Both the precision and recall rate of the new localization method are improved, and the processing time of the new method is less. On the aspect of locating accuracy, the new method gives tighter bounding boxes. We finally enhance the quality of the detected text region by multi-frame averaging and local thresholding. Our method can handle multi-language video text with complex background including a great range of font sizes and styles. The results after above steps can be directly processed by OCR system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信