{"title":"利用聚类和立体信息在人脸图像标签传播中的应用","authors":"O. Zoidi, N. Nikolaidis, I. Pitas","doi":"10.1109/CIBIM.2013.6607910","DOIUrl":null,"url":null,"abstract":"In this paper, a method for performing semiautomatic identity label annotation on facial images, obtained from monocular and stereoscopic videos is introduced. The proposed method exploits prior information for the data structure, obtained from the application of a clustering algorithm, for the selection of the facial images from which label inference should begin. Then, a sparse graph is constructed according to the Linear Neighborhood Propagation (LNP) method and, finally, label inference is performed according to an iterative update rule. In the case of stereoscopic videos, the classification decision is determined by the combined information of the left and right channels. The objective of the proposed framework is to be used by archivists for semi-automatic annotation of television content, in order to further enable journalists to directly access video shots/frames of interest.","PeriodicalId":286155,"journal":{"name":"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Exploiting clustering and stereo information in label propagation on facial images\",\"authors\":\"O. Zoidi, N. Nikolaidis, I. Pitas\",\"doi\":\"10.1109/CIBIM.2013.6607910\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a method for performing semiautomatic identity label annotation on facial images, obtained from monocular and stereoscopic videos is introduced. The proposed method exploits prior information for the data structure, obtained from the application of a clustering algorithm, for the selection of the facial images from which label inference should begin. Then, a sparse graph is constructed according to the Linear Neighborhood Propagation (LNP) method and, finally, label inference is performed according to an iterative update rule. In the case of stereoscopic videos, the classification decision is determined by the combined information of the left and right channels. The objective of the proposed framework is to be used by archivists for semi-automatic annotation of television content, in order to further enable journalists to directly access video shots/frames of interest.\",\"PeriodicalId\":286155,\"journal\":{\"name\":\"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBIM.2013.6607910\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Symposium on Computational Intelligence in Biometrics and Identity Management (CIBIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBIM.2013.6607910","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploiting clustering and stereo information in label propagation on facial images
In this paper, a method for performing semiautomatic identity label annotation on facial images, obtained from monocular and stereoscopic videos is introduced. The proposed method exploits prior information for the data structure, obtained from the application of a clustering algorithm, for the selection of the facial images from which label inference should begin. Then, a sparse graph is constructed according to the Linear Neighborhood Propagation (LNP) method and, finally, label inference is performed according to an iterative update rule. In the case of stereoscopic videos, the classification decision is determined by the combined information of the left and right channels. The objective of the proposed framework is to be used by archivists for semi-automatic annotation of television content, in order to further enable journalists to directly access video shots/frames of interest.