Towards scalable activity recognition: adapting zero-effort crowdsourced acoustic models

Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia Pub Date : 2013-12-02 DOI:10.1145/2541831.2541832

Long-Van Nguyen-Dinh, Ulf Blanke, G. Tröster

{"title":"Towards scalable activity recognition: adapting zero-effort crowdsourced acoustic models","authors":"Long-Van Nguyen-Dinh, Ulf Blanke, G. Tröster","doi":"10.1145/2541831.2541832","DOIUrl":null,"url":null,"abstract":"Human activity recognition systems traditionally require a manual annotation of massive training data, which is laborious and non-scalable. An alternative approach is mining existing online crowd-sourced repositories for open-ended, free annotated training data. However, differences across data sources or in observed contexts prevent a crowd-sourced based model reaching user-dependent recognition rates. To enhance the use of crowd-sourced data in activity recognition, we take an essential step forward by adapting a generic model based on crowd-sourced data to a personalized model. In this work, we investigate two adapting approaches: 1) a semi-supervised learning to combine crowd-sourced data and unlabeled user data, and 2) an active-learning to query the user for labeling samples where the crowd-sourced based model fails to recognize. We test our proposed approaches on 7 users using auditory modality on mobile phones with a total data of 14 days and up to 9 daily context classes. Experimental results indicate that the semi-supervised model can indeed improve the recognition accuracy up to 21% but is still significantly outperformed by a supervised model on user data. In the active learning scheme, the crowd-sourced model can reach the performance of the supervised model by requesting labels of 0.7% of user data only. Our work illustrates a promising first step towards an unobtrusive, efficient and open-ended context recognition system by adapting free online crowd-sourced data into a personalized model.","PeriodicalId":286368,"journal":{"name":"Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2541831.2541832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

Human activity recognition systems traditionally require a manual annotation of massive training data, which is laborious and non-scalable. An alternative approach is mining existing online crowd-sourced repositories for open-ended, free annotated training data. However, differences across data sources or in observed contexts prevent a crowd-sourced based model reaching user-dependent recognition rates. To enhance the use of crowd-sourced data in activity recognition, we take an essential step forward by adapting a generic model based on crowd-sourced data to a personalized model. In this work, we investigate two adapting approaches: 1) a semi-supervised learning to combine crowd-sourced data and unlabeled user data, and 2) an active-learning to query the user for labeling samples where the crowd-sourced based model fails to recognize. We test our proposed approaches on 7 users using auditory modality on mobile phones with a total data of 14 days and up to 9 daily context classes. Experimental results indicate that the semi-supervised model can indeed improve the recognition accuracy up to 21% but is still significantly outperformed by a supervised model on user data. In the active learning scheme, the crowd-sourced model can reach the performance of the supervised model by requesting labels of 0.7% of user data only. Our work illustrates a promising first step towards an unobtrusive, efficient and open-ended context recognition system by adapting free online crowd-sourced data into a personalized model.

查看原文本刊更多论文

面向可扩展的活动识别:适应零努力众包声学模型

传统的人类活动识别系统需要对大量的训练数据进行手工标注，这是一项费力且不可扩展的工作。另一种方法是挖掘现有的在线众包存储库，以获取开放的、免费的带注释的训练数据。然而，数据源之间的差异或观察到的上下文阻碍了基于众包的模型达到依赖用户的识别率。为了加强在活动识别中使用众包数据，我们迈出了重要的一步，将基于众包数据的通用模型调整为个性化模型。在这项工作中，我们研究了两种自适应方法:1)半监督学习，将众包数据和未标记的用户数据结合起来;2)主动学习，向用户查询基于众包模型无法识别的标记样本。我们在7个使用手机听觉模式的用户身上测试了我们提出的方法，这些用户的总数据为14天，每天最多有9个上下文课程。实验结果表明，半监督模型确实可以将识别精度提高到21%，但在用户数据上仍然明显优于监督模型。在主动学习方案中，众包模型只需请求0.7%的用户数据的标签就可以达到监督模型的性能。我们的工作表明，通过将免费的在线众包数据转化为个性化模型，朝着不引人注目、高效和开放式的上下文识别系统迈出了有希望的第一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia

自引率

0.00%

发文量