实景图像文本区域分割软件的开发

IF 1.2 Q4 OPTICS

Computer Optics Pub Date : 2022-10-01 DOI:10.18287/2412-6179-co-1047

V. A. Lobanova, Yuliya Ivanova

{"title":"实景图像文本区域分割软件的开发","authors":"V. A. Lobanova, Yuliya Ivanova","doi":"10.18287/2412-6179-co-1047","DOIUrl":null,"url":null,"abstract":"This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"73 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of software for the segmentation of text areas in real-scene images\",\"authors\":\"V. A. Lobanova, Yuliya Ivanova\",\"doi\":\"10.18287/2412-6179-co-1047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.\",\"PeriodicalId\":46692,\"journal\":{\"name\":\"Computer Optics\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Optics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18287/2412-6179-co-1047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18287/2412-6179-co-1047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}

引用次数: 0

摘要

本文讨论了一种用于真实场景图像文本区域分割的神经网络算法的设计和开发。在回顾了现有的神经网络模型后，选择U-net模型作为基础。在此基础上，提出并实现了一种实景图像文本区域检测算法。网络的实验训练允许人们定义神经网络参数，如输入图像的大小和网络层的数量和类型。双边滤波器和低通滤波器被认为是预处理阶段。通过对图像进行旋转、压缩和分割，增加了KAIST场景文本数据库中的图像数量。所获得的结果被发现在f测量值方面优于竞争方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development of software for the segmentation of text areas in real-scene images

This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Optics OPTICS-

CiteScore

4.20

自引率

10.00%

发文量

审稿时长

9 weeks

期刊介绍： The journal is intended for researchers and specialists active in the following research areas: Diffractive Optics; Information Optical Technology; Nanophotonics and Optics of Nanostructures; Image Analysis & Understanding; Information Coding & Security; Earth Remote Sensing Technologies; Hyperspectral Data Analysis; Numerical Methods for Optics and Image Processing; Intelligent Video Analysis. The journal "Computer Optics" has been published since 1987. Published 6 issues per year.