Cognitive Multimodal Processing: from Signal to Behavior

RFMIR '14 Pub Date : 2014-11-16 DOI:10.1145/2666253.2666264

A. Potamianos

{"title":"Cognitive Multimodal Processing: from Signal to Behavior","authors":"A. Potamianos","doi":"10.1145/2666253.2666264","DOIUrl":null,"url":null,"abstract":"Affective computing, social and behavioral signal processing are emerging research disciplines that attempt to automatically label the emotional, social and cognitive state of humans using features extracted from audio-visual streams. I argue that this monumental task cannot succeed unless the particularities of the human cognitive processing are incorporated into our models, especially given that often the quantities we are called to model are either biased cognitive abstractions of the real world or altogether fictional creations of our cognition. A variety of cognitive processes that make computational modeling especially challenging are outlined herein, notably: 1) (joint) attention and saliency, 2) common ground, conceptual semantic spaces and representation learning, 3) fusion across time, modalities and cognitive representation layers, and 4) dual-system processing (system one vs. system two) and cognitive decision non-linearities. The grand challenges are outlined and examples are given illustrating how to design models that are both high-performing and respect basic cognitive organization principles. It is shown that such models can achieve good generalization and representation power, as well as model cognitive biases, a prerequisite for modeling and predicting human behavior.","PeriodicalId":254468,"journal":{"name":"RFMIR '14","volume":"350 12","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RFMIR '14","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2666253.2666264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Affective computing, social and behavioral signal processing are emerging research disciplines that attempt to automatically label the emotional, social and cognitive state of humans using features extracted from audio-visual streams. I argue that this monumental task cannot succeed unless the particularities of the human cognitive processing are incorporated into our models, especially given that often the quantities we are called to model are either biased cognitive abstractions of the real world or altogether fictional creations of our cognition. A variety of cognitive processes that make computational modeling especially challenging are outlined herein, notably: 1) (joint) attention and saliency, 2) common ground, conceptual semantic spaces and representation learning, 3) fusion across time, modalities and cognitive representation layers, and 4) dual-system processing (system one vs. system two) and cognitive decision non-linearities. The grand challenges are outlined and examples are given illustrating how to design models that are both high-performing and respect basic cognitive organization principles. It is shown that such models can achieve good generalization and representation power, as well as model cognitive biases, a prerequisite for modeling and predicting human behavior.

查看原文本刊更多论文

认知多模态加工:从信号到行为

情感计算、社会和行为信号处理是新兴的研究学科，它们试图利用从视听流中提取的特征来自动标记人类的情感、社会和认知状态。我认为，除非人类认知过程的特殊性被纳入我们的模型，否则这项艰巨的任务不可能成功，特别是考虑到我们被要求建模的数量通常要么是对现实世界的有偏见的认知抽象，要么是对我们认知的完全虚构的创造。本文概述了使计算建模特别具有挑战性的各种认知过程，特别是:1)(联合)注意和显著性，2)共同点，概念语义空间和表征学习，3)跨时间，模式和认知表征层的融合，以及4)双系统处理(系统一与系统二)和认知决策非线性。本文概述了面临的重大挑战，并举例说明了如何设计既高效又尊重基本认知组织原则的模型。结果表明，该模型具有良好的泛化能力和表征能力，并能消除人类行为建模和预测的前提条件——认知偏差。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

RFMIR '14

自引率

0.00%

发文量