2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)最新文献_第6页

Text detection in born-digital images by mass estimation 基于质量估计的非数字图像文本检测

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486591

Jiamin Xu, P. Shivakumara, Tong Lu, C. Tan, M. Blumenstein

{"title":"Text detection in born-digital images by mass estimation","authors":"Jiamin Xu, P. Shivakumara, Tong Lu, C. Tan, M. Blumenstein","doi":"10.1109/ACPR.2015.7486591","DOIUrl":"https://doi.org/10.1109/ACPR.2015.7486591","url":null,"abstract":"There is a need for effective web-document understanding due to the explosive progress of internet and network technologies. In this paper, we propose a new method for text detection in born-digital images by introducing a mass estimation concept. We propose to explore super-pixel information of different color channels to identify text atoms in images. The proposed method uses similarity graphs and spectral clustering to identify candidate text regions. We propose a new idea of mapping Gabor responses of a candidate text region to a spatial circle to study the spatial coherency of pixels. We introduce a mass estimation concept to identify text candidates from the pixel distribution in a spatial circle. The linear linkage graphs help in grouping text candidates to obtain full text lines. The same Gabor responses are used as features to eliminate false positives with an SVM classifier. We evaluate the proposed method for the testing on standard datasets, such as ICDAR 2013 (challenge-1) and the Situ et al. dataset. Experimental results on both the datasets show that the proposed method outperforms the existing methods.","PeriodicalId":240902,"journal":{"name":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131737744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Beyond human recognition: A CNN-based framework for handwritten character recognition 超越人类识别:一个基于cnn的手写字符识别框架

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486592

Li Chen, Song Wang, Wei-liang Fan, Jun Sun, S. Naoi

引用次数: 120

New texture-spatial features for keyword spotting in video images 用于视频图像关键字识别的新纹理空间特征

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486532

P. Shivakumara, Guozhu Liang, Sangheeta Roy, U. Pal, Tong Lu

引用次数: 4

Bayesian nonparametric inference of latent topic hierarchies for multimodal data 多模态数据潜在主题层次的贝叶斯非参数推理

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486501

Takuji Shimamawari, K. Eguchi, A. Takasu

{"title":"Bayesian nonparametric inference of latent topic hierarchies for multimodal data","authors":"Takuji Shimamawari, K. Eguchi, A. Takasu","doi":"10.1109/ACPR.2015.7486501","DOIUrl":"https://doi.org/10.1109/ACPR.2015.7486501","url":null,"abstract":"Research on multimodal data analysis such as annotated image analysis is becoming more important than ever due to the increase in the amount of data. One of the approaches to this problem is multimodal topic models as an extension of latent Dirichlet allocation (LDA). Symmetric correspondence topic models (SymCorrLDA) are state-of-the-art multimodal topic models that can appropriately model multimodal data considering inter-modal dependencies. Incidentally, hierarchically structured categories can help users find relevant data from a large amount of data collection. Hierarchical topic models such as hierarchical latent Dirichlet allocation (hLDA) can discover a tree-structured hierarchy of latent topics from a given unimodal data collection; however, no hierarchical topic models can appropriately handle multimodal data considering intermodal mutual dependencies. In this paper, we propose h-SymCorrLDA to discover latent topic hierarchies from multimodal data by combining the ideas of the two previously mentioned models: multimodal topic models and hierarchical topic models. We demonstrate the effectiveness of our model compared with several baseline models through experiments with two datasets of annotated images.","PeriodicalId":240902,"journal":{"name":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122880004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Video-level violence rating with rank prediction 带有等级预测的视频暴力等级

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486468

Yu Wang, Jien Kato

引用次数: 2

Stereoscopic image warping for enhancing composition aesthetics 立体图像翘曲，增强构图美感

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486582

Md Baharul Islam, L. Wong, Chee-Onn Wong, Kok-Lim Low

引用次数: 12

Efficient graph spanning structures for large database image retrieval 大型数据库图像检索的高效图生成结构

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486572

B. Mocanu, Ruxandra Tapu, T. Zaharia

引用次数: 1

Learning clustered sub-spaces for sketch-based image retrieval 学习聚类子空间用于基于草图的图像检索

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486573

Koustav Ghosal, Ameya Prabhu, Riddhiman Dasgupta, A. Namboodiri

引用次数: 1

Video-based object recognition with weakly supervised object localization 基于视频的弱监督目标定位识别

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486463

Yang Liu, R. Kouskouridas, Tae-Kyun Kim

引用次数: 1

Accent classification with phonetic vowel representation 用语音元音表示的重音分类

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI: 10.1109/ACPR.2015.7486559

Zhenhao Ge, Ying‐Ying Tan, A. Ganapathiraju

引用次数: 8