Structural Learning of Activities from Sparse Datasets

Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07) Pub Date : 2007-03-19 DOI:10.1109/PERCOM.2007.33

F. Albinali, N. Davies, A. Friday

{"title":"Structural Learning of Activities from Sparse Datasets","authors":"F. Albinali, N. Davies, A. Friday","doi":"10.1109/PERCOM.2007.33","DOIUrl":null,"url":null,"abstract":"A major challenge in pervasive computing is to develop systems that can reliably recognize human activity patterns, such as bathing from sensor data. Typical sensor deployments generate sparse datasets with thousands of sensor readings and few instances of activities. The imbalance between the number of features (i.e. sensors firing) and the classification targets (i.e. activities) complicates the learning process. In this paper, we propose a novel framework for discovering relationships between sensor signals and observed human activities from sparse datasets. The framework builds on the use of Bayesian networks for modeling activities by representing statistical dependencies between sensors. This allows us to solve two key problems: firstly, how to automatically determine an effective structure for a Bayesian network that recognizes a particular activity without human intervention; and, secondly, we address the pragmatic problem of sparse training data, where the data available to train the activity recognizers is limited. In our approach, we `learn' the structure of the Bayesian networks automatically from the sensor data. We optimize this process in 3 ways: firstly, we perform multicollinearity analysis to focus on orthogonal sensor data with minimal redundancy. Secondly, we propose Efron's bootstrapping to generate large training sets that capture important features of an activity. Finally, we find the best Bayesian network that explains our data using a heuristic search that is unbiased to the ordering between consecutive variables. We evaluate our approach using a data set gathered from MIT's PlaceLab. The inferred networks correctly identify activities for 85% of the time","PeriodicalId":314022,"journal":{"name":"Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PERCOM.2007.33","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

Abstract

A major challenge in pervasive computing is to develop systems that can reliably recognize human activity patterns, such as bathing from sensor data. Typical sensor deployments generate sparse datasets with thousands of sensor readings and few instances of activities. The imbalance between the number of features (i.e. sensors firing) and the classification targets (i.e. activities) complicates the learning process. In this paper, we propose a novel framework for discovering relationships between sensor signals and observed human activities from sparse datasets. The framework builds on the use of Bayesian networks for modeling activities by representing statistical dependencies between sensors. This allows us to solve two key problems: firstly, how to automatically determine an effective structure for a Bayesian network that recognizes a particular activity without human intervention; and, secondly, we address the pragmatic problem of sparse training data, where the data available to train the activity recognizers is limited. In our approach, we `learn' the structure of the Bayesian networks automatically from the sensor data. We optimize this process in 3 ways: firstly, we perform multicollinearity analysis to focus on orthogonal sensor data with minimal redundancy. Secondly, we propose Efron's bootstrapping to generate large training sets that capture important features of an activity. Finally, we find the best Bayesian network that explains our data using a heuristic search that is unbiased to the ordering between consecutive variables. We evaluate our approach using a data set gathered from MIT's PlaceLab. The inferred networks correctly identify activities for 85% of the time

查看原文本刊更多论文

稀疏数据集中活动的结构学习

普适计算的一个主要挑战是开发能够可靠地识别人类活动模式的系统，例如从传感器数据中洗澡。典型的传感器部署会生成具有数千个传感器读数和少量活动实例的稀疏数据集。特征数量(即传感器发射)和分类目标(即活动)之间的不平衡使学习过程复杂化。在本文中，我们提出了一种新的框架，用于从稀疏数据集中发现传感器信号与观测到的人类活动之间的关系。该框架建立在贝叶斯网络的基础上，通过表示传感器之间的统计依赖关系来对活动进行建模。这允许我们解决两个关键问题:首先，如何自动确定贝叶斯网络的有效结构，使其在没有人为干预的情况下识别特定的活动;其次，我们解决了稀疏训练数据的实用问题，其中可用于训练活动识别器的数据是有限的。在我们的方法中，我们从传感器数据中自动“学习”贝叶斯网络的结构。我们通过3种方式优化这一过程:首先，我们进行多重共线性分析，以最小化冗余的正交传感器数据为重点。其次，我们提出了Efron的自举来生成捕获活动重要特征的大型训练集。最后，我们找到了最好的贝叶斯网络，它使用启发式搜索来解释我们的数据，该搜索对连续变量之间的顺序是无偏的。我们使用从麻省理工学院PlaceLab收集的数据集来评估我们的方法。推断出的网络在85%的时间里正确地识别活动

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom'07)

自引率

0.00%

发文量