Missing sensor value estimation method for participatory sensing environment

2014 IEEE International Conference on Pervasive Computing and Communications (PerCom) Pub Date : 2014-03-24 DOI:10.1109/PerCom.2014.6813950

H. Kurasawa, Hiroshi Sato, Atsushi Yamamoto, Hitoshi Kawasaki, Motonori Nakamura, Yohei Fujii, Hajime Matsumura

{"title":"Missing sensor value estimation method for participatory sensing environment","authors":"H. Kurasawa, Hiroshi Sato, Atsushi Yamamoto, Hitoshi Kawasaki, Motonori Nakamura, Yohei Fujii, Hajime Matsumura","doi":"10.1109/PerCom.2014.6813950","DOIUrl":null,"url":null,"abstract":"Participatory sensing produces incomplete sensor data. Thus, we have to fill in the gaps of any missing values in the sensor data in order to provide sensor-based services. We propose a method to estimate a missing value of incomplete sensor data. It accurately estimates a missing value by repeating two processes: selecting sensors locally correlated with the sensor that includes the missing value and then updating the training sensor dataset that consist of data from the selected sensors available for multiple regression. This procedure effectively helps to find more suitable neighbor records of a query record from the training sensor dataset and to refine the regression model using the records. It overcomes three problems that other estimation methods have: a decrease in the amount of available training sensor dataset due to missing values, the difficulty in finding similar records of a query due to the “curse of dimensionality,” and the complexity in formalizing the estimation model due to “overfitting.” The main feature of our method is the way it repeatedly prunes inessential sensors while exploiting the anti-monotone property in which the training sensor dataset R' that consist of the sensors V' ⊂ V is larger than the data R that consist of V. Empirical evaluations done using public datasets in which we appended missing values show that our method increases the training sensor dataset for estimation and improves estimation accuracy through repeated sensor selections. Furthermore, we confirmed through a field trial and a life-log enrichment trial, that our method was effective for estimating missing sensor values in a participatory sensing environment.","PeriodicalId":263520,"journal":{"name":"2014 IEEE International Conference on Pervasive Computing and Communications (PerCom)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Pervasive Computing and Communications (PerCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PerCom.2014.6813950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Participatory sensing produces incomplete sensor data. Thus, we have to fill in the gaps of any missing values in the sensor data in order to provide sensor-based services. We propose a method to estimate a missing value of incomplete sensor data. It accurately estimates a missing value by repeating two processes: selecting sensors locally correlated with the sensor that includes the missing value and then updating the training sensor dataset that consist of data from the selected sensors available for multiple regression. This procedure effectively helps to find more suitable neighbor records of a query record from the training sensor dataset and to refine the regression model using the records. It overcomes three problems that other estimation methods have: a decrease in the amount of available training sensor dataset due to missing values, the difficulty in finding similar records of a query due to the “curse of dimensionality,” and the complexity in formalizing the estimation model due to “overfitting.” The main feature of our method is the way it repeatedly prunes inessential sensors while exploiting the anti-monotone property in which the training sensor dataset R' that consist of the sensors V' ⊂ V is larger than the data R that consist of V. Empirical evaluations done using public datasets in which we appended missing values show that our method increases the training sensor dataset for estimation and improves estimation accuracy through repeated sensor selections. Furthermore, we confirmed through a field trial and a life-log enrichment trial, that our method was effective for estimating missing sensor values in a participatory sensing environment.

查看原文本刊更多论文

参与式传感环境下缺失传感器值估计方法

参与式传感产生不完整的传感器数据。因此，为了提供基于传感器的服务，我们必须填补传感器数据中任何缺失值的空白。我们提出了一种估计不完整传感器数据缺失值的方法。它通过重复两个过程来准确地估计缺失值:选择与包含缺失值的传感器局部相关的传感器，然后更新由可用于多元回归的选定传感器数据组成的训练传感器数据集。该过程有效地帮助从训练传感器数据集中找到查询记录的更合适的邻居记录，并利用这些记录来改进回归模型。它克服了其他估计方法存在的三个问题:由于缺失值而导致可用训练传感器数据集数量减少，由于“维度诅咒”而难以找到类似查询记录，以及由于“过拟合”而形式化估计模型的复杂性。我们方法的主要特征是，它在利用反单调特性的同时重复修剪不必要的传感器，其中由传感器V´∧V组成的训练传感器数据集R′大于由V组成的数据R。使用我们附加缺失值的公共数据集进行的经验评估表明，我们的方法增加了用于估计的训练传感器数据集，并通过重复的传感器选择提高了估计精度。此外，我们通过现场试验和生命日志浓缩试验证实，我们的方法对于在参与式传感环境中估计缺失的传感器值是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE International Conference on Pervasive Computing and Communications (PerCom)

自引率

0.00%

发文量