{"title":"学习视觉对象和单词联想","authors":"Yie-Tarng Chen, Ting-Zhi Wang, Wen-Hsien Fang, Didik Purwanto","doi":"10.1109/ICSIPA.2017.8120577","DOIUrl":null,"url":null,"abstract":"This paper presents a new discriminative learning framework to associate the relationship between the objects and the words in an image and perform template matching scheme for complex association patterns. The problem is first formulated as a bipartite graph matching problem. Thereafter, structural support vector machine (SVM) is employed to obtain the optimal compatibility function to encode the association rules between the objects and the words. Moreover, an iterative inference procedure is developed to alternatively infer the association of visual objects and texts and the selection of the template model. Simulations show that the new method outperforms the existing competing counterparts.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"498 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning visual object and word association\",\"authors\":\"Yie-Tarng Chen, Ting-Zhi Wang, Wen-Hsien Fang, Didik Purwanto\",\"doi\":\"10.1109/ICSIPA.2017.8120577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new discriminative learning framework to associate the relationship between the objects and the words in an image and perform template matching scheme for complex association patterns. The problem is first formulated as a bipartite graph matching problem. Thereafter, structural support vector machine (SVM) is employed to obtain the optimal compatibility function to encode the association rules between the objects and the words. Moreover, an iterative inference procedure is developed to alternatively infer the association of visual objects and texts and the selection of the template model. Simulations show that the new method outperforms the existing competing counterparts.\",\"PeriodicalId\":268112,\"journal\":{\"name\":\"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"volume\":\"498 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSIPA.2017.8120577\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2017.8120577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种新的判别学习框架,用于关联图像中物体与单词之间的关系,并对复杂的关联模式执行模板匹配方案。该问题首先被表述为一个二部图匹配问题。然后,利用结构支持向量机(structural support vector machine, SVM)得到最优兼容函数,对对象与单词之间的关联规则进行编码。此外,还开发了一个迭代推理程序来交替地推断视觉对象和文本的关联以及模板模型的选择。仿真结果表明,该方法优于现有的同类方法。
This paper presents a new discriminative learning framework to associate the relationship between the objects and the words in an image and perform template matching scheme for complex association patterns. The problem is first formulated as a bipartite graph matching problem. Thereafter, structural support vector machine (SVM) is employed to obtain the optimal compatibility function to encode the association rules between the objects and the words. Moreover, an iterative inference procedure is developed to alternatively infer the association of visual objects and texts and the selection of the template model. Simulations show that the new method outperforms the existing competing counterparts.