Stefan Winterberger, Dmitriy An, Martin Biallas, Andrew Paice
{"title":"Smart home environment data across 4 European countries","authors":"Stefan Winterberger, Dmitriy An, Martin Biallas, Andrew Paice","doi":"10.1016/j.dib.2025.111636","DOIUrl":null,"url":null,"abstract":"<div><div>This paper describes a dataset of anonymised smart home environment data that was collected during a project over 359 days (16.05.2023-08.05.2024). The dataset contains information about temperature (°C), humidity (%), ambient light (lux), CO<sub>2</sub> (ppm), VOC (ppm), sound pressure level (dB) in a time interval of 2–5 min in addition to event based data from PIR-Sensors and door contact sensors. Additionally, time and location information for each data point is available in the form of a time stamp, the user ID, the room and the country. The dataset was collected in 4 different European countries from a total of 62 users at their residential settings. Different installations had different sets of sensors, meaning not all parameters were measured at every location. The target group for the field trials was elderly people 65+.</div><div>During the project it could be shown that the data can be used to estimate presence in a room, based on the environmental data only, where the output of the PIR-Sensors were used as proxy labels. The weakness of the dataset is the lack of validated ground truth, which makes supervised learning approaches difficult. The strength of the dataset lies in the variety of sensors including sound pressure and the long period (nearly 1 year) of high frequency measurements in different countries.</div><div>Collecting data in real-world residential settings is challenging, but by making this dataset publicly available, we provide researchers with a valuable resource to explore smart home applications, presence detection, and environmental monitoring in everyday life.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111636"},"PeriodicalIF":1.0000,"publicationDate":"2025-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925003671","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This paper describes a dataset of anonymised smart home environment data that was collected during a project over 359 days (16.05.2023-08.05.2024). The dataset contains information about temperature (°C), humidity (%), ambient light (lux), CO2 (ppm), VOC (ppm), sound pressure level (dB) in a time interval of 2–5 min in addition to event based data from PIR-Sensors and door contact sensors. Additionally, time and location information for each data point is available in the form of a time stamp, the user ID, the room and the country. The dataset was collected in 4 different European countries from a total of 62 users at their residential settings. Different installations had different sets of sensors, meaning not all parameters were measured at every location. The target group for the field trials was elderly people 65+.
During the project it could be shown that the data can be used to estimate presence in a room, based on the environmental data only, where the output of the PIR-Sensors were used as proxy labels. The weakness of the dataset is the lack of validated ground truth, which makes supervised learning approaches difficult. The strength of the dataset lies in the variety of sensors including sound pressure and the long period (nearly 1 year) of high frequency measurements in different countries.
Collecting data in real-world residential settings is challenging, but by making this dataset publicly available, we provide researchers with a valuable resource to explore smart home applications, presence detection, and environmental monitoring in everyday life.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.