实景图像文本区域分割软件的开发

IF 1.1 Q4 OPTICS
V. A. Lobanova, Yuliya Ivanova
{"title":"实景图像文本区域分割软件的开发","authors":"V. A. Lobanova, Yuliya Ivanova","doi":"10.18287/2412-6179-co-1047","DOIUrl":null,"url":null,"abstract":"This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.","PeriodicalId":46692,"journal":{"name":"Computer Optics","volume":"73 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of software for the segmentation of text areas in real-scene images\",\"authors\":\"V. A. Lobanova, Yuliya Ivanova\",\"doi\":\"10.18287/2412-6179-co-1047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.\",\"PeriodicalId\":46692,\"journal\":{\"name\":\"Computer Optics\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Optics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18287/2412-6179-co-1047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18287/2412-6179-co-1047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

摘要

本文讨论了一种用于真实场景图像文本区域分割的神经网络算法的设计和开发。在回顾了现有的神经网络模型后,选择U-net模型作为基础。在此基础上,提出并实现了一种实景图像文本区域检测算法。网络的实验训练允许人们定义神经网络参数,如输入图像的大小和网络层的数量和类型。双边滤波器和低通滤波器被认为是预处理阶段。通过对图像进行旋转、压缩和分割,增加了KAIST场景文本数据库中的图像数量。所获得的结果被发现在f测量值方面优于竞争方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development of software for the segmentation of text areas in real-scene images
This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Optics
Computer Optics OPTICS-
CiteScore
4.20
自引率
10.00%
发文量
73
审稿时长
9 weeks
期刊介绍: The journal is intended for researchers and specialists active in the following research areas: Diffractive Optics; Information Optical Technology; Nanophotonics and Optics of Nanostructures; Image Analysis & Understanding; Information Coding & Security; Earth Remote Sensing Technologies; Hyperspectral Data Analysis; Numerical Methods for Optics and Image Processing; Intelligent Video Analysis. The journal "Computer Optics" has been published since 1987. Published 6 issues per year.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信