面对不平衡的数据,关于使用绩效指标的建议。

László A Jeni, Jeffrey F Cohn, Fernando De La Torre
{"title":"面对不平衡的数据,关于使用绩效指标的建议。","authors":"László A Jeni, Jeffrey F Cohn, Fernando De La Torre","doi":"10.1109/ACII.2013.47","DOIUrl":null,"url":null,"abstract":"<p><p>Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.</p>","PeriodicalId":89154,"journal":{"name":"International Conference on Affective Computing and Intelligent Interaction and workshops : [proceedings]. ACII (Conference)","volume":"2013 ","pages":"245-251"},"PeriodicalIF":0.0000,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285355/pdf/nihms-554962.pdf","citationCount":"0","resultStr":"{\"title\":\"Facing Imbalanced Data Recommendations for the Use of Performance Metrics.\",\"authors\":\"László A Jeni, Jeffrey F Cohn, Fernando De La Torre\",\"doi\":\"10.1109/ACII.2013.47\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.</p>\",\"PeriodicalId\":89154,\"journal\":{\"name\":\"International Conference on Affective Computing and Intelligent Interaction and workshops : [proceedings]. ACII (Conference)\",\"volume\":\"2013 \",\"pages\":\"245-251\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285355/pdf/nihms-554962.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Affective Computing and Intelligent Interaction and workshops : [proceedings]. ACII (Conference)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACII.2013.47\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Affective Computing and Intelligent Interaction and workshops : [proceedings]. ACII (Conference)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACII.2013.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

识别面部动作单元(AU)对于情境分析和自动视频标注非常重要。以往的工作强调人脸跟踪和注册以及特征分类器的选择。相对被忽视的是不平衡数据对动作单元检测的影响。虽然机器学习界已经意识到训练分类器时数据偏斜的问题,但很少有人关注偏斜会如何影响性能指标。为了解决这个问题,我们使用模拟分类器和三个主要数据库进行了实验,这三个数据库在大小、FACS 编码类型和倾斜程度上各不相同。我们评估了偏斜对阈值指标(准确度、F-分数、科恩卡帕和克里彭多夫α)和等级指标(接收者操作特征曲线(ROC)下面积和精确度-召回曲线)的影响。除 ROC 曲线下面积外,其他指标均受偏斜分布的影响,在许多情况下,偏斜分布的影响非常明显。虽然 ROC 不受偏斜的影响,但精确再现曲线表明 ROC 可能会掩盖较差的表现。我们的研究结果表明,偏斜是评估性能指标的一个关键因素。为了避免或尽量减少对性能的偏斜估计,我们建议在报告所得分数的同时报告偏斜归一化分数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Facing Imbalanced Data Recommendations for the Use of Performance Metrics.

Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experiments using both simulated classifiers and three major databases that differ in size, type of FACS coding, and degree of skew. We evaluated influence of skew on both threshold metrics (Accuracy, F-score, Cohen's kappa, and Krippendorf's alpha) and rank metrics (area under the receiver operating characteristic (ROC) curve and precision-recall curve). With exception of area under the ROC curve, all were attenuated by skewed distributions, in many cases, dramatically so. While ROC was unaffected by skew, precision-recall curves suggest that ROC may mask poor performance. Our findings suggest that skew is a critical factor in evaluating performance metrics. To avoid or minimize skew-biased estimates of performance, we recommend reporting skew-normalized scores along with the obtained ones.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信