Improving the convergence of co-training for audio-visual person identification

2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE) Pub Date : 2016-07-06 DOI:10.1109/SPLIM.2016.7528400

Nicolai Bæk Thomsen, Xiaodong Duan, Z. Tan, B. Lindberg, S. H. Jensen

{"title":"Improving the convergence of co-training for audio-visual person identification","authors":"Nicolai Bæk Thomsen, Xiaodong Duan, Z. Tan, B. Lindberg, S. H. Jensen","doi":"10.1109/SPLIM.2016.7528400","DOIUrl":null,"url":null,"abstract":"Person identification is a very important task for intelligent devices when communicating or interacting with humans. A potential problem in real applications is that the amount of enrollment data is insufficient. When multiple modalities are available, it is possible to re-train the system online by exploiting the conditional independence between the modalities and thus improving classification accuracy. This can be achieved by the well-known CO-training algorithm [1]. In this work we present a novel modification to the CO-training algorithm, which is concerned with how new observations/samples are chosen at each iteration to re-train the system in order to improve the classification accuracy faster, i.e., better convergence. In our method, the new data are chosen not only based on the score from the other modality but also the score from the self modality. We demonstrate our proposed method on a multimodal person identification task using the MOBIO database, and show that it outperforms the baseline method, in terms of convergency, by a large margin.","PeriodicalId":297318,"journal":{"name":"2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPLIM.2016.7528400","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Person identification is a very important task for intelligent devices when communicating or interacting with humans. A potential problem in real applications is that the amount of enrollment data is insufficient. When multiple modalities are available, it is possible to re-train the system online by exploiting the conditional independence between the modalities and thus improving classification accuracy. This can be achieved by the well-known CO-training algorithm [1]. In this work we present a novel modification to the CO-training algorithm, which is concerned with how new observations/samples are chosen at each iteration to re-train the system in order to improve the classification accuracy faster, i.e., better convergence. In our method, the new data are chosen not only based on the score from the other modality but also the score from the self modality. We demonstrate our proposed method on a multimodal person identification task using the MOBIO database, and show that it outperforms the baseline method, in terms of convergency, by a large margin.

查看原文本刊更多论文

提高视听人识别协同训练的收敛性

人的识别是智能设备与人类交流或互动时的一项非常重要的任务。实际应用中的一个潜在问题是注册数据的数量不足。当有多个模态可用时，可以利用模态之间的条件独立性在线重新训练系统，从而提高分类精度。这可以通过著名的CO-training算法[1]来实现。在这项工作中，我们提出了对CO-training算法的一种新的修改，它涉及如何在每次迭代中选择新的观测值/样本来重新训练系统，以便更快地提高分类精度，即更好的收敛性。在我们的方法中，新数据的选择不仅基于其他模态的得分，而且基于自模态的得分。我们使用MOBIO数据库在多模态人识别任务上演示了我们提出的方法，并表明它在收敛性方面优于基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE)

自引率

0.00%

发文量