Limitations with activity recognition methodology & data sets

Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication Pub Date : 2014-09-13 DOI:10.1145/2638728.2641306

J. W. Lockhart, Gary M. Weiss

{"title":"Limitations with activity recognition methodology & data sets","authors":"J. W. Lockhart, Gary M. Weiss","doi":"10.1145/2638728.2641306","DOIUrl":null,"url":null,"abstract":"Human activity recognition (AR) has begun to mature as a field, but for AR research to thrive, large, diverse, high quality, AR data sets must be publically available and AR methodology must be clearly documented and standardized. In the process of comparing our AR research to other efforts, however, we found that most AR data sets are sufficiently limited as to impact the reliability of existing research results, and that many AR research papers do not clearly document their experimental methodology and often make unrealistic assumptions. In this paper we outline problems and limitations with AR data sets and describe the methodology problems we noticed, in the hope that this will lead to the creation of improved and better documented data sets and improved AR experimental methodology. Although we cover a broad array of methodological issues, our primary focus is on an often overlooked factor, model type, which determines how AR training and test data are partitioned---and how AR models are evaluated. Our prior research indicates that personal, hybrid, and impersonal/universal models yield dramatically different performance [30], yet many research studies do not highlight or even identify this factor. We make concrete recommendations to address these issues and also describe our own publically available AR data sets.","PeriodicalId":20496,"journal":{"name":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","volume":"27 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2014-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"63","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2638728.2641306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 63

Abstract

Human activity recognition (AR) has begun to mature as a field, but for AR research to thrive, large, diverse, high quality, AR data sets must be publically available and AR methodology must be clearly documented and standardized. In the process of comparing our AR research to other efforts, however, we found that most AR data sets are sufficiently limited as to impact the reliability of existing research results, and that many AR research papers do not clearly document their experimental methodology and often make unrealistic assumptions. In this paper we outline problems and limitations with AR data sets and describe the methodology problems we noticed, in the hope that this will lead to the creation of improved and better documented data sets and improved AR experimental methodology. Although we cover a broad array of methodological issues, our primary focus is on an often overlooked factor, model type, which determines how AR training and test data are partitioned---and how AR models are evaluated. Our prior research indicates that personal, hybrid, and impersonal/universal models yield dramatically different performance [30], yet many research studies do not highlight or even identify this factor. We make concrete recommendations to address these issues and also describe our own publically available AR data sets.

查看原文本刊更多论文

活动识别方法和数据集的局限性

人类活动识别(AR)作为一个领域已经开始成熟，但为了使AR研究蓬勃发展，必须公开大量、多样化、高质量的AR数据集，并且必须明确记录和标准化AR方法。然而，在将我们的AR研究与其他研究进行比较的过程中，我们发现大多数AR数据集都非常有限，以至于影响了现有研究结果的可靠性，而且许多AR研究论文没有清楚地记录他们的实验方法，并且经常做出不切实际的假设。在本文中，我们概述了AR数据集的问题和局限性，并描述了我们注意到的方法问题，希望这将导致创建改进的和更好的记录数据集以及改进的AR实验方法。虽然我们涵盖了广泛的方法问题，但我们主要关注的是一个经常被忽视的因素，即模型类型，它决定了如何划分AR训练和测试数据，以及如何评估AR模型。我们之前的研究表明，个人模型、混合模型和非个人模型/通用模型产生了显著不同的表现[30]，但许多研究并没有强调甚至确定这一因素。我们提出了解决这些问题的具体建议，并描述了我们自己的公开AR数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication

自引率

0.00%

发文量