Regularized Multi-Concept MIL for weakly-supervised facial behavior categorization

Proceedings of the British Machine Vision Conference 2014 Pub Date : 1900-01-01 DOI:10.5244/C.28.13

Adria Ruiz, Joost van de Weijer, Xavier Binefa

{"title":"Regularized Multi-Concept MIL for weakly-supervised facial behavior categorization","authors":"Adria Ruiz, Joost van de Weijer, Xavier Binefa","doi":"10.5244/C.28.13","DOIUrl":null,"url":null,"abstract":"In this work, we address the problem of estimating high-level semantic labels for videos of recorded people by means of analysing their facial expressions. This problem, to which we refer as facial behavior categorization, is a weakly-supervised learning problem where we do not have access to frame-by-frame facial gesture annotations but only weak-labels at the video level are available. Therefore, the goal is to learn a set of discriminative expressions appearing during the training videos and how they determine these labels. Facial behavior categorization can be posed as a Multi-Instance-Learning (MIL) problem and we propose a novel MIL method called Regularized Multi-Concept MIL to solve it. In contrast to previous approaches applied in facial behavior analysis, RMC-MIL follows a Multi-Concept assumption which allows different facial expressions (concepts) to contribute differently to the video-label. Moreover, to handle with the high-dimensional nature of facial-descriptors, RMC-MIL uses a discriminative approach to model the concepts and structured sparsity regularization to discard non-informative features. RMC-MIL is posed as a convex-constrained optimization problem where all the parameters are jointly learned using the Projected-Quasi-Newton method. In our experiments, we use two public data-sets to show the advantages of the Regularized MultiConcept approach and its improvement compared to existing MIL methods. RMC-MIL outperforms state-of-the-art results in the UNBC data-set for pain detection.","PeriodicalId":278286,"journal":{"name":"Proceedings of the British Machine Vision Conference 2014","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the British Machine Vision Conference 2014","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5244/C.28.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

In this work, we address the problem of estimating high-level semantic labels for videos of recorded people by means of analysing their facial expressions. This problem, to which we refer as facial behavior categorization, is a weakly-supervised learning problem where we do not have access to frame-by-frame facial gesture annotations but only weak-labels at the video level are available. Therefore, the goal is to learn a set of discriminative expressions appearing during the training videos and how they determine these labels. Facial behavior categorization can be posed as a Multi-Instance-Learning (MIL) problem and we propose a novel MIL method called Regularized Multi-Concept MIL to solve it. In contrast to previous approaches applied in facial behavior analysis, RMC-MIL follows a Multi-Concept assumption which allows different facial expressions (concepts) to contribute differently to the video-label. Moreover, to handle with the high-dimensional nature of facial-descriptors, RMC-MIL uses a discriminative approach to model the concepts and structured sparsity regularization to discard non-informative features. RMC-MIL is posed as a convex-constrained optimization problem where all the parameters are jointly learned using the Projected-Quasi-Newton method. In our experiments, we use two public data-sets to show the advantages of the Regularized MultiConcept approach and its improvement compared to existing MIL methods. RMC-MIL outperforms state-of-the-art results in the UNBC data-set for pain detection.

查看原文本刊更多论文

弱监督面部行为分类的正则化多概念MIL

在这项工作中，我们通过分析记录的人的面部表情来解决估计视频的高级语义标签的问题。这个问题，我们称之为面部行为分类，是一个弱监督学习问题，我们无法访问逐帧的面部手势注释，只有视频级别的弱标签可用。因此，我们的目标是学习一组在训练视频中出现的判别表达式，以及它们是如何确定这些标签的。面部行为分类是一个多实例学习(MIL)问题，我们提出了一种新的多实例学习方法——正则化多概念MIL来解决这个问题。与之前应用于面部行为分析的方法相比，rmmc - mil遵循多概念假设，允许不同的面部表情(概念)对视频标签做出不同的贡献。此外，为了处理面部描述符的高维特性，rmmc - mil使用判别方法对概念进行建模，并使用结构化稀疏性正则化来丢弃非信息特征。rmmc - mil是一个凸约束优化问题，其中所有参数都是用投影拟牛顿方法联合学习的。在我们的实验中，我们使用两个公共数据集来展示正则化多概念方法的优势及其与现有MIL方法相比的改进。rmmc - mil在疼痛检测的UNBC数据集中优于最先进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the British Machine Vision Conference 2014

自引率

0.00%

发文量