T. Mondal, Arundhati Tarafdar, N. Ragot, Jean-Yves Ramel, U. Pal
{"title":"改进的基于形状代码的多脚本文档单词匹配","authors":"T. Mondal, Arundhati Tarafdar, N. Ragot, Jean-Yves Ramel, U. Pal","doi":"10.1109/ACPR.2015.7486490","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a shape code based wordimage matching (word-spotting) technique for word retrieval in multilingual documents, written in Indian languages. Each query word image to be searched is represented by a sequence of shape codes that corresponds to primitives. Then an inexact string matching technique is applied for measuring the similarity between the codes generated from the query word image and each candidate word images, obtained from the document. Based on the similarity score, we retrieve the document where the query image is found. Experimental results on Bangla, Devanagari scripts document image databases confirms the feasibility and efficiency of our proposed approach.","PeriodicalId":240902,"journal":{"name":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved shape code based word matching for multi-script documents\",\"authors\":\"T. Mondal, Arundhati Tarafdar, N. Ragot, Jean-Yves Ramel, U. Pal\",\"doi\":\"10.1109/ACPR.2015.7486490\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a shape code based wordimage matching (word-spotting) technique for word retrieval in multilingual documents, written in Indian languages. Each query word image to be searched is represented by a sequence of shape codes that corresponds to primitives. Then an inexact string matching technique is applied for measuring the similarity between the codes generated from the query word image and each candidate word images, obtained from the document. Based on the similarity score, we retrieve the document where the query image is found. Experimental results on Bangla, Devanagari scripts document image databases confirms the feasibility and efficiency of our proposed approach.\",\"PeriodicalId\":240902,\"journal\":{\"name\":\"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACPR.2015.7486490\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACPR.2015.7486490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved shape code based word matching for multi-script documents
In this paper, we propose a shape code based wordimage matching (word-spotting) technique for word retrieval in multilingual documents, written in Indian languages. Each query word image to be searched is represented by a sequence of shape codes that corresponds to primitives. Then an inexact string matching technique is applied for measuring the similarity between the codes generated from the query word image and each candidate word images, obtained from the document. Based on the similarity score, we retrieve the document where the query image is found. Experimental results on Bangla, Devanagari scripts document image databases confirms the feasibility and efficiency of our proposed approach.