基于视听特征和Fisher矢量编码的降噪估计

AVEC '14 Pub Date : 2014-11-07 DOI:10.1145/2661806.2661817

V. Jain, J. Crowley, A. Dey, A. Lux

{"title":"基于视听特征和Fisher矢量编码的降噪估计","authors":"V. Jain, J. Crowley, A. Dey, A. Lux","doi":"10.1145/2661806.2661817","DOIUrl":null,"url":null,"abstract":"We investigate the use of two visual descriptors: Local Binary Patterns-Three Orthogonal Planes(LBP-TOP) and Dense Trajectories for depression assessment on the AVEC 2014 challenge dataset. We encode the visual information generated by the two descriptors using Fisher Vector encoding which has been shown to be one of the best performing methods to encode visual data for image classification. We also incorporate audio features in the final system to introduce multiple input modalities. The results produced using Linear Support Vector regression outperform the baseline method.","PeriodicalId":318508,"journal":{"name":"AVEC '14","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"63","resultStr":"{\"title\":\"Depression Estimation Using Audiovisual Features and Fisher Vector Encoding\",\"authors\":\"V. Jain, J. Crowley, A. Dey, A. Lux\",\"doi\":\"10.1145/2661806.2661817\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We investigate the use of two visual descriptors: Local Binary Patterns-Three Orthogonal Planes(LBP-TOP) and Dense Trajectories for depression assessment on the AVEC 2014 challenge dataset. We encode the visual information generated by the two descriptors using Fisher Vector encoding which has been shown to be one of the best performing methods to encode visual data for image classification. We also incorporate audio features in the final system to introduce multiple input modalities. The results produced using Linear Support Vector regression outperform the baseline method.\",\"PeriodicalId\":318508,\"journal\":{\"name\":\"AVEC '14\",\"volume\":\"111 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"63\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AVEC '14\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2661806.2661817\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AVEC '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2661806.2661817","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 63

摘要

我们研究了在AVEC 2014挑战数据集上使用两种视觉描述符:局部二元模式-三个正交平面(LBP-TOP)和密集轨迹来评估抑郁。我们使用Fisher向量编码对两个描述符生成的视觉信息进行编码，Fisher向量编码已被证明是编码图像分类视觉数据的最佳方法之一。我们还在最终系统中加入了音频功能，以引入多种输入模式。使用线性支持向量回归产生的结果优于基线方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Depression Estimation Using Audiovisual Features and Fisher Vector Encoding

We investigate the use of two visual descriptors: Local Binary Patterns-Three Orthogonal Planes(LBP-TOP) and Dense Trajectories for depression assessment on the AVEC 2014 challenge dataset. We encode the visual information generated by the two descriptors using Fisher Vector encoding which has been shown to be one of the best performing methods to encode visual data for image classification. We also incorporate audio features in the final system to introduce multiple input modalities. The results produced using Linear Support Vector regression outperform the baseline method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AVEC '14

自引率

0.00%

发文量