基于MSER的自然图像多方向文本识别与分类

2020 International Conference for Emerging Technology (INCET) Pub Date : 2020-06-01 DOI:10.1109/incet49848.2020.9154142

R. P, Shamjiith, R. K

{"title":"基于MSER的自然图像多方向文本识别与分类","authors":"R. P, Shamjiith, R. K","doi":"10.1109/incet49848.2020.9154142","DOIUrl":null,"url":null,"abstract":"Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.","PeriodicalId":174411,"journal":{"name":"2020 International Conference for Emerging Technology (INCET)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Multi-Oriented Text Recognition and Classification in Natural Images using MSER\",\"authors\":\"R. P, Shamjiith, R. K\",\"doi\":\"10.1109/incet49848.2020.9154142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.\",\"PeriodicalId\":174411,\"journal\":{\"name\":\"2020 International Conference for Emerging Technology (INCET)\",\"volume\":\"92 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference for Emerging Technology (INCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/incet49848.2020.9154142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/incet49848.2020.9154142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

文本识别是图像处理领域中一个广阔的研究和实验领域。这是一个过程，通过该过程，系统定位任何文本存在的区域并提取它们。提取的文本必须经过多次处理后转换为人类可读的形式，并根据内容将其分类为有意义的类。这里使用的平台是MATLAB R2018a。首先，对ICDAR 2017数据集进行预处理，去除噪声内容。然后进行分割，大致了解文本内容。使用最大稳定极值区域(MSER)提取必要的特征。然后对得到的结果进行笔画宽度变换处理。文本的几何特征与区域匹配。最后，对所有处理过的区域进行合并，得到准确的文本，并用OCR(光学字符识别)进行提取。将这些属性分类为有意义的属性对提取的文本更有意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-Oriented Text Recognition and Classification in Natural Images using MSER

Text recognition is a vast field of research and experimentation under image processing domain. It is a process by which the system locates the area whichever any kind of text is present and to extract them. The extracted text must be converted to human readable form after several processing and to classify them into meaningful classes based on the content. The platform used here is MATLAB R2018a. Firstly, Pre-processing is done on the ICDAR 2017 dataset in order to remove noise content. Then Segmentation is done to get a rough idea of the textual content present. Needful features are extracted using MSER (Maximally stable extremal regions). The obtained result is then processed with Stroke width transform. Geometrical features of text are matched with the regions. Finally, all of the processed regions are merged to obtain the exact text and extract them with OCR (Optical Character Recognition). Classifying these into meaningful attributes makes more sense to the extracted text.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference for Emerging Technology (INCET)

自引率

0.00%

发文量