基于音频和情感线索的面试表现自动评估系统

K. Priya, S. M. Mansoor Roomi, P. Shanmugavadivu, M. Sethuraman, P. Kalaivani
{"title":"基于音频和情感线索的面试表现自动评估系统","authors":"K. Priya, S. M. Mansoor Roomi, P. Shanmugavadivu, M. Sethuraman, P. Kalaivani","doi":"10.1109/ICACCS.2019.8728458","DOIUrl":null,"url":null,"abstract":"Automatic analysis and performance evaluation of interviewees is a largely unexplored and challenging problem. The proposed work provides a computational structure to enumerate the performance of the interviewee in the context of communication to give their performance feedback from the analysis of multimodal signals such as facial images and voice. A video captured during the interview is split into audio and visual frames. From the visual frames, the face is detected and their emotions are analyzed with the help of Histogram of Oriented Gradients. The facial expressions are classified by using Support Vector Machine. The classified facial expressions are happy, fear, sad, neutral, surprise, disgust and angry. From the audio cues,, the Mel Frequency Cepstral Coefficient features are extracted and this is categorized as fluent, non-fluent-pause and non-fluent-stammer. Both emotion and fluency of the candidate are fused to find performance score. This automated analysis provides the ratings for interview behavior such as poor, medium, high performance.","PeriodicalId":249139,"journal":{"name":"2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Automated System for the Assesment of Interview Performance through Audio & Emotion Cues\",\"authors\":\"K. Priya, S. M. Mansoor Roomi, P. Shanmugavadivu, M. Sethuraman, P. Kalaivani\",\"doi\":\"10.1109/ICACCS.2019.8728458\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic analysis and performance evaluation of interviewees is a largely unexplored and challenging problem. The proposed work provides a computational structure to enumerate the performance of the interviewee in the context of communication to give their performance feedback from the analysis of multimodal signals such as facial images and voice. A video captured during the interview is split into audio and visual frames. From the visual frames, the face is detected and their emotions are analyzed with the help of Histogram of Oriented Gradients. The facial expressions are classified by using Support Vector Machine. The classified facial expressions are happy, fear, sad, neutral, surprise, disgust and angry. From the audio cues,, the Mel Frequency Cepstral Coefficient features are extracted and this is categorized as fluent, non-fluent-pause and non-fluent-stammer. Both emotion and fluency of the candidate are fused to find performance score. This automated analysis provides the ratings for interview behavior such as poor, medium, high performance.\",\"PeriodicalId\":249139,\"journal\":{\"name\":\"2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS)\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACCS.2019.8728458\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACCS.2019.8728458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

受访者的自动分析和绩效评估在很大程度上是一个未被探索和具有挑战性的问题。提出的工作提供了一个计算结构,以列举受访者在交流背景下的表现,并从面部图像和语音等多模态信号的分析中给予他们的表现反馈。在采访过程中拍摄的视频分为音频和视觉帧。从视觉帧中检测人脸,并利用梯度直方图分析人脸的情绪。使用支持向量机对面部表情进行分类。分类的面部表情有快乐、恐惧、悲伤、中性、惊讶、厌恶和愤怒。从音频线索中提取梅尔频率倒谱系数特征,并将其分类为流利、非流利停顿和非流利结巴。候选人的情感和流利度被融合在一起,以找到表现分数。这种自动分析提供了面试行为的等级,比如差、中、高的表现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Automated System for the Assesment of Interview Performance through Audio & Emotion Cues
Automatic analysis and performance evaluation of interviewees is a largely unexplored and challenging problem. The proposed work provides a computational structure to enumerate the performance of the interviewee in the context of communication to give their performance feedback from the analysis of multimodal signals such as facial images and voice. A video captured during the interview is split into audio and visual frames. From the visual frames, the face is detected and their emotions are analyzed with the help of Histogram of Oriented Gradients. The facial expressions are classified by using Support Vector Machine. The classified facial expressions are happy, fear, sad, neutral, surprise, disgust and angry. From the audio cues,, the Mel Frequency Cepstral Coefficient features are extracted and this is categorized as fluent, non-fluent-pause and non-fluent-stammer. Both emotion and fluency of the candidate are fused to find performance score. This automated analysis provides the ratings for interview behavior such as poor, medium, high performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信