Pseudo Zernike moments based approach for text detection and localisation from lecture videos

IF 1.4 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Belkacem Soundes, Guezouli Larbi, Zidat Samir
{"title":"Pseudo Zernike moments based approach for text detection and localisation from lecture videos","authors":"Belkacem Soundes, Guezouli Larbi, Zidat Samir","doi":"10.1504/IJCSE.2016.10011674","DOIUrl":null,"url":null,"abstract":"Scene text presents challenging characteristics mainly related to acquisition circumstances and environmental changes resulting in low quality videos. In this paper, we present a scene text detection algorithm based on pseudo Zernike moments (PZMs) and stroke features from low resolution lecture videos. Algorithm mainly consists of three steps: slide detection, text detection and segmentation and non-text filtering. In lecture videos, slide region is a key object carrying almost all important information; hence slide region has to be extracted and segmented from other scene objects considered as background for later processing. Slide region detection and segmentation is done by applying pseudo Zernike moment's based on RGB frames. Text detection and extraction is performed using PZMs segmentation over V channel of HSV colour space, and then stroke feature is used to filter out non-text region and to remove false positives. The algorithm is robust to illumination, low resolution and uneven luminance from compressed videos. Effectiveness of PZM description leads to very few false positives comparing to other approached. Moreover resulting images can be used directly by OCR engines and no more processing is needed.","PeriodicalId":47380,"journal":{"name":"International Journal of Computational Science and Engineering","volume":"40 1","pages":"274-283"},"PeriodicalIF":1.4000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCSE.2016.10011674","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1

Abstract

Scene text presents challenging characteristics mainly related to acquisition circumstances and environmental changes resulting in low quality videos. In this paper, we present a scene text detection algorithm based on pseudo Zernike moments (PZMs) and stroke features from low resolution lecture videos. Algorithm mainly consists of three steps: slide detection, text detection and segmentation and non-text filtering. In lecture videos, slide region is a key object carrying almost all important information; hence slide region has to be extracted and segmented from other scene objects considered as background for later processing. Slide region detection and segmentation is done by applying pseudo Zernike moment's based on RGB frames. Text detection and extraction is performed using PZMs segmentation over V channel of HSV colour space, and then stroke feature is used to filter out non-text region and to remove false positives. The algorithm is robust to illumination, low resolution and uneven luminance from compressed videos. Effectiveness of PZM description leads to very few false positives comparing to other approached. Moreover resulting images can be used directly by OCR engines and no more processing is needed.
基于伪泽尼克矩的演讲视频文本检测和定位方法
场景文本呈现出具有挑战性的特征,主要与获取环境和环境变化有关,导致视频质量低。在本文中,我们提出了一种基于伪泽尼克矩(PZMs)和笔画特征的低分辨率演讲视频场景文本检测算法。算法主要包括三个步骤:幻灯片检测、文本检测与分割和非文本过滤。在讲课视频中,幻灯片区域是承载几乎所有重要信息的关键对象;因此,必须从作为背景的其他场景对象中提取和分割滑动区域,以供后续处理。采用基于RGB帧的伪泽尼克矩进行滑动区域检测和分割。在HSV颜色空间的V通道上使用PZMs分割进行文本检测和提取,然后使用笔画特征过滤掉非文本区域并去除误报。该算法对压缩视频的光照、低分辨率和不均匀亮度具有较强的鲁棒性。与其他方法相比,PZM描述的有效性导致很少的误报。此外,生成的图像可以直接由OCR引擎使用,而不需要更多的处理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Computational Science and Engineering
International Journal of Computational Science and Engineering COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
4.00
自引率
40.00%
发文量
73
期刊介绍: Computational science and engineering is an emerging and promising discipline in shaping future research and development activities in both academia and industry, in fields ranging from engineering, science, finance, and economics, to arts and humanities. New challenges arise in the modelling of complex systems, sophisticated algorithms, advanced scientific and engineering computing and associated (multidisciplinary) problem-solving environments. Because the solution of large and complex problems must cope with tight timing schedules, powerful algorithms and computational techniques, are inevitable. IJCSE addresses the state of the art of all aspects of computational science and engineering with emphasis on computational methods and techniques for science and engineering applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信