基于MSER的自然图像多方向文本识别与分类

R. P, Shamjiith, R. K
{"title":"基于MSER的自然图像多方向文本识别与分类","authors":"R. P, Shamjiith, R. K","doi":"10.1109/incet49848.2020.9154142","DOIUrl":null,"url":null,"abstract":"Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Multi-Oriented Text Recognition and Classification in Natural Images using MSER\",\"authors\":\"R. P, Shamjiith, R. K\",\"doi\":\"10.1109/incet49848.2020.9154142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"92 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

文本识别是图像处理领域中一个广阔的研究和实验领域。这是一个过程,通过该过程,系统定位任何文本存在的区域并提取它们。提取的文本必须经过多次处理后转换为人类可读的形式,并根据内容将其分类为有意义的类。这里使用的平台是MATLAB R2018a。首先,对ICDAR 2017数据集进行预处理,去除噪声内容。然后进行分割,大致了解文本内容。使用最大稳定极值区域(MSER)提取必要的特征。然后对得到的结果进行笔画宽度变换处理。文本的几何特征与区域匹配。最后,对所有处理过的区域进行合并,得到准确的文本,并用OCR(光学字符识别)进行提取。将这些属性分类为有意义的属性对提取的文本更有意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-Oriented Text Recognition and Classification in Natural Images using MSER
Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信