Sameera Ghayyur, Primal Pappachan, Guoxi Wang, S. Mehrotra, N. Venkatasubramanian
{"title":"Designing privacy preserving data sharing middleware for internet of things","authors":"Sameera Ghayyur, Primal Pappachan, Guoxi Wang, S. Mehrotra, N. Venkatasubramanian","doi":"10.1145/3419016.3431484","DOIUrl":"https://doi.org/10.1145/3419016.3431484","url":null,"abstract":"The rise of low-cost Internet of Things (IoT) sensing and communication capabilities has given rise to a range of new smart services that rely on heterogeneous data from devices embedded in our everyday lives. The provision of such IoT services relies on environmental or user data from other data controllers (e.g. network provider, water agency, building management). Recent privacy regulations such as the European General Data Protection Requirement (GDPR) and California Consumer Privacy Act (CCPA) have made it mandatory for data controllers to perform enhanced processing of the shared data with appropriate privacy-preserving mechanisms before release to service providers. To facilitate this, we propose PE-IoT, a system for orchestrating privacy-enhanced data flows that (a) provides users (data subjects) with capabilities to opt-in/opt-out in the data that is shared with the service providers and (b) enable data controllers to invoke a range of Privacy Enhancing Technologies (PETs) such as anonymization, randomization, and perturbation to transform data streams into their privacy preserving counterparts. PE-IoT is based on a new model for privacy compliant data sharing and we describe the design and architecture of the PE-IoT system based on this model.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"37 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132360350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Person tracking and identification using cameras and wi-fi channel state information (CSI) from smartphones: dataset","authors":"Shiwei Fang, Sirajum Munir, S. Nirjon","doi":"10.1145/3419016.3431488","DOIUrl":"https://doi.org/10.1145/3419016.3431488","url":null,"abstract":"Human sensing, motion trajectory estimation, and identification are crucial to applications such as customer analysis, public safety, smart homes and cities, and access control. In the wake of the global COVID-19 pandemic, the ability to perform contact tracing effectively is vital to limit the spread of infectious diseases. Although vision-based solutions such as facial recognition can potentially scale to millions of people for identification, the privacy implications and laws to banning such a technology limit its applicability in the real world. Other techniques may require installations and maintenance of multiple units, and/or lack long-term re-identification capability. We present a dataset to fuse WiFi Channel State Information (CSI) and camera-based location information for person identification and tracking. While previous works focused on collecting WiFi CSI from stationary transmitters and receivers (laptop, desktop, or router), our WiFi CSI data are generated from a smartphone that is carried while someone is moving. In addition, we collect camera-generated real-world coordinate for each WiFi packet that can serve as ground truth location. The dataset is collected in different environments and with various numbers of persons in the scene at several days to capture real-world variations.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115413517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The quest for raw signals: a quality review of publicly available photoplethysmography datasets","authors":"Florian Wolling, Kristof Van Laerhoven","doi":"10.1145/3419016.3431485","DOIUrl":"https://doi.org/10.1145/3419016.3431485","url":null,"abstract":"Photoplethysmography is an optical measurement principle which is present in most modern wearable devices such as fitness trackers and smartwatches. As the analysis of physiological signals requires reliable but energy-efficient algorithms, suitable datasets are essential for their development, evaluation, and benchmark. A broad variety of clinical datasets is available with recordings from medical pulse oximeters which traditionally apply transmission mode photoplethysmography at the fingertip or earlobe. However, only few publicly available datasets utilize recent reflective mode sensors which are typically worn at the wrist and whose signals show different characteristics. Moreover, the recordings are often advertised as raw, but then turn out to be preprocessed and filtered while the applied parameters are not stated. In this way, the heart rate and its variability can be extracted, but interesting secondary information from the non-stationary signal is often lost. Consequently, the test of novel signal processing approaches for wearable devices usually implies the gathering of own or the use of inappropriate data. In this paper, we present a multi-varied method to analyze the suitability and applicability of presumably raw photoplethysmography signals. We present an analytical tool which applies 7 decision metrics to characterize 10 publicly available datasets with a focus on less or ideally unfiltered, raw signals. Besides the review, we finally provide a guideline for future datasets, to suit to and to be applicable in digital signal processing, to support the development and evaluation of algorithms for resource-limited wearable devices.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124747712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-city street-sidewalk imagery from pedestrian mobile cameras: dataset","authors":"Shubham Jain","doi":"10.1145/3419016.3431486","DOIUrl":"https://doi.org/10.1145/3419016.3431486","url":null,"abstract":"This paper presents TerraFirma, a multi-city dataset which captures street and sidewalk imagery from the pedestrians' perspective. Motivated by challenges in the realm of pedestrian safety, we present a diverse and extensive dataset that provides a foundation for the design and validation of pedestrian safety systems that rely on street-sidewalk imagery. The data was collected by 9 volunteers in 4 metropolitan cities across the world. Volunteers carried mobile cameras or smartphones in a texting position, such that the rear camera was directed to the ground in front of them. TerraFirma classifies images by the material used for street/sidewalk construction in each city. The detailed description of dataset accrual is accompanied by the public release of the dataset.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127852891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Lessons from large scale campus deployment","authors":"Rishiraj Adhikary, Soham Pachpande, Nipun Batra","doi":"10.1145/3419016.3431490","DOIUrl":"https://doi.org/10.1145/3419016.3431490","url":null,"abstract":"Large scale campus deployments in the past have resulted in energy conservation measures, data validation, and software architectures. Inspired by the success and learnings from such previous deployments, we present our work on deployment involving sensing various aspect of campus sustainability like water, electricity, solar produce, air quality, and parking lot occupancy. Our full deployment spanned more 171 days. We used 469 sensors, collecting a maximum of 190 MB of data daily. We discuss the deployment challenges and the learnings obtained from them. We address the data collection challenges by providing best practices measures and provide insights from the installation of wireless radio communication modules. Our deployment can act as a reconnaissance guide for campus deployment, especially in developing countries.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"35 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123256694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pollen video library for benchmarking detection, classification, tracking and novelty detection tasks: dataset","authors":"Nam Cao, Matthias Meyer, L. Thiele, O. Saukh","doi":"10.1145/3419016.3431487","DOIUrl":"https://doi.org/10.1145/3419016.3431487","url":null,"abstract":"Automatic pollen sensing is important to understand the local distribution of pollen in urban environments and to give personalized advice to the citizens suffering from seasonal pollen allergies to help milder the symptoms. We present a challenging data set of labeled sequential pollen images recorded with an off-the-shelf microscope to test and improve on a variety of tasks, such as pollen detection, classification, tracking, and novelty detection. Pollen samples were gathered using a novel cyclone-based particle collector. The data set contains 16 pollen types with around 35'000 microscopic images per type and covers pollen samples from trees and grasses gathered in Graz, Austria between February and August 2020. In addition, we share microscopic videos taken in the wild over 3 days in February and March 2020 with an automated pollen measurement system based on the same microscope technology to test and compare model performance in a natural environment. The data is available on Zenodo (https://zenodo.org/record/4120033).","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128165915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A video dataset of a wooden box assembly process: dataset","authors":"Jiayun Zhang, Petr Byvshev, Yu Xiao","doi":"10.1145/3419016.3431492","DOIUrl":"https://doi.org/10.1145/3419016.3431492","url":null,"abstract":"This paper presents a video dataset of a 9-step wooden box assembly process including 17 subjects. The main strength of this dataset is the design of a standard and uniform workflow and the use of multiple cameras capturing videos from two different viewpoints. 62 video files were collected with the total size of 20 GB and the total duration of 13 hours. Each of the video is complemented with temporal annotations that indicate the starting and ending timestamps of each work step in the assembly process. We also provide statistical descriptive analyses of the recorded processes. Our dataset can be utilized for developing solutions to human activity recognition, process documentation, and many others that involve human-object interaction.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121726474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Z. Hussain, D. Waterworth, Murtadha M. N. Aldeer, W. Zhang, Quan Z. Sheng
{"title":"Toothbrushing data and analysis of its potential use in human activity recognition applications: dataset","authors":"Z. Hussain, D. Waterworth, Murtadha M. N. Aldeer, W. Zhang, Quan Z. Sheng","doi":"10.1145/3419016.3431489","DOIUrl":"https://doi.org/10.1145/3419016.3431489","url":null,"abstract":"In this paper, we describe and analyze a time-series dataset from toothbrushing activity using brush-attached and wearable sensors. The data was collected from 17 participants when they brushed their teeth over one week in 5 different locations. The dataset consists of 62 toothbrushing sessions for each of the brush-attached and wearable sensor approaches, using both electric and manual brushes. The average duration of each session is 2 minutes. One sensor device was attached to the handle of the brush while the other was worn by the participants as a wrist-watch. We collected the data from a 3-axis accelerometer and a 3-axis gyroscope at a 200 Hz sampling rate. Most of the data has been labelled. We investigated the characteristics of the data using spectral analysis and performed a pre-processing pipeline in order to generate features used to train a Support Vector Machine Classifier. We were able to identify which part of the jaw was being brushed with 98.6% accuracy.","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121582016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Laksh Bhatia, Michael J. Breza, Ramona Marfievici, J. Mccann
{"title":"LoED: The LoRaWAN at the edge dataset: dataset","authors":"Laksh Bhatia, Michael J. Breza, Ramona Marfievici, J. Mccann","doi":"10.1145/3419016.3431491","DOIUrl":"https://doi.org/10.1145/3419016.3431491","url":null,"abstract":"This paper presents the LoRaWAN at the Edge Dataset (LoED), an open LoRaWAN packet dataset collected at gateways. Real-world LoRaWAN datasets are important for repeatable sensor-network and communications research and evaluation as, if carefully collected, they provide realistic working assumptions. LoED data is collected from nine gateways over a four month period in a dense urban environment. The dataset contains packet header information and all physical layer properties reported by gateways such as the CRC, RSSI, SNR and spreading factor. Files are provided to analyse the data and get aggregated statistics. The dataset is available at: doi.org/10.5281/zenodo.4121430","PeriodicalId":177625,"journal":{"name":"Proceedings of the Third Workshop on Data: Acquisition To Analysis","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124321164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}