{"title":"挖掘鲁棒邻域用于传感器数据的质量控制","authors":"D. Galarus, R. Angryk","doi":"10.1145/2534303.2534309","DOIUrl":null,"url":null,"abstract":"Neighborhoods, as used for spatial and spatial-temporal data mining, define areas of similarity in data. Unless defined to account for outliers, missing data and spatial-temporal variation, the robustness of methods utilizing neighborhoods will suffer. The focus of this paper is to demonstrate that neighborhoods can be defined and used in a robust manner that is resistant to such challenges. Our approach employs robust methods in both neighborhood construction and neighborhood application to estimate observations. These methods were tested with a large weather sensor data set from the National Weather Service that includes quality control indicators. Results were compared to a popular method used in the weather community, evaluated by root-mean-squared error and grouped by quality control indicator. Our first time published results show that our methods are robust in the presence of outliers, missing data and spatial-temporal variation, yielding results consistent with quality control labels assigned to the data by the provider by way of an extensive rule-based system, indicating that our approaches show promise for use in quality control assessment.","PeriodicalId":190366,"journal":{"name":"International Workshop on GeoStreaming","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Mining robust neighborhoods for quality control of sensor data\",\"authors\":\"D. Galarus, R. Angryk\",\"doi\":\"10.1145/2534303.2534309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Neighborhoods, as used for spatial and spatial-temporal data mining, define areas of similarity in data. Unless defined to account for outliers, missing data and spatial-temporal variation, the robustness of methods utilizing neighborhoods will suffer. The focus of this paper is to demonstrate that neighborhoods can be defined and used in a robust manner that is resistant to such challenges. Our approach employs robust methods in both neighborhood construction and neighborhood application to estimate observations. These methods were tested with a large weather sensor data set from the National Weather Service that includes quality control indicators. Results were compared to a popular method used in the weather community, evaluated by root-mean-squared error and grouped by quality control indicator. Our first time published results show that our methods are robust in the presence of outliers, missing data and spatial-temporal variation, yielding results consistent with quality control labels assigned to the data by the provider by way of an extensive rule-based system, indicating that our approaches show promise for use in quality control assessment.\",\"PeriodicalId\":190366,\"journal\":{\"name\":\"International Workshop on GeoStreaming\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Workshop on GeoStreaming\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2534303.2534309\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on GeoStreaming","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2534303.2534309","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mining robust neighborhoods for quality control of sensor data
Neighborhoods, as used for spatial and spatial-temporal data mining, define areas of similarity in data. Unless defined to account for outliers, missing data and spatial-temporal variation, the robustness of methods utilizing neighborhoods will suffer. The focus of this paper is to demonstrate that neighborhoods can be defined and used in a robust manner that is resistant to such challenges. Our approach employs robust methods in both neighborhood construction and neighborhood application to estimate observations. These methods were tested with a large weather sensor data set from the National Weather Service that includes quality control indicators. Results were compared to a popular method used in the weather community, evaluated by root-mean-squared error and grouped by quality control indicator. Our first time published results show that our methods are robust in the presence of outliers, missing data and spatial-temporal variation, yielding results consistent with quality control labels assigned to the data by the provider by way of an extensive rule-based system, indicating that our approaches show promise for use in quality control assessment.