{"title":"基于卷积神经网络的特征级融合自然图像多语言合成字符识别","authors":"Asghar Ali, M. Pickering","doi":"10.1109/DICTA.2018.8615845","DOIUrl":null,"url":null,"abstract":"In this paper, a new Convolutional Neural Network (CNN) architecture is proposed for synthetic Urdu and English character recognition in natural scene images. The features are extracted using three separate sub-models of the CNN which are then fused in one feature vector. The network is purely trained on the synthetic character images of English and Urdu texts in natural images. For English text, the Chars74k-Font dataset is used and for Urdu text, the synthetic dataset is created by automatically cropping the image patches from four background image datasets and then putting characters at random positions within the image patch. The network is evaluated on a combined synthetic dataset of English and Urdu characters and the separate synthetic characters of Urdu and English datasets. The experimental results show that the network performs well on synthetic datasets.","PeriodicalId":130057,"journal":{"name":"2018 Digital Image Computing: Techniques and Applications (DICTA)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Feature-Level Fusion using Convolutional Neural Network for Multi-Language Synthetic Character Recognition in Natual Images\",\"authors\":\"Asghar Ali, M. Pickering\",\"doi\":\"10.1109/DICTA.2018.8615845\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new Convolutional Neural Network (CNN) architecture is proposed for synthetic Urdu and English character recognition in natural scene images. The features are extracted using three separate sub-models of the CNN which are then fused in one feature vector. The network is purely trained on the synthetic character images of English and Urdu texts in natural images. For English text, the Chars74k-Font dataset is used and for Urdu text, the synthetic dataset is created by automatically cropping the image patches from four background image datasets and then putting characters at random positions within the image patch. The network is evaluated on a combined synthetic dataset of English and Urdu characters and the separate synthetic characters of Urdu and English datasets. The experimental results show that the network performs well on synthetic datasets.\",\"PeriodicalId\":130057,\"journal\":{\"name\":\"2018 Digital Image Computing: Techniques and Applications (DICTA)\",\"volume\":\"61 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Digital Image Computing: Techniques and Applications (DICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DICTA.2018.8615845\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA.2018.8615845","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature-Level Fusion using Convolutional Neural Network for Multi-Language Synthetic Character Recognition in Natual Images
In this paper, a new Convolutional Neural Network (CNN) architecture is proposed for synthetic Urdu and English character recognition in natural scene images. The features are extracted using three separate sub-models of the CNN which are then fused in one feature vector. The network is purely trained on the synthetic character images of English and Urdu texts in natural images. For English text, the Chars74k-Font dataset is used and for Urdu text, the synthetic dataset is created by automatically cropping the image patches from four background image datasets and then putting characters at random positions within the image patch. The network is evaluated on a combined synthetic dataset of English and Urdu characters and the separate synthetic characters of Urdu and English datasets. The experimental results show that the network performs well on synthetic datasets.