A semi-labelled dataset for fault detection in air handling units from a large-scale office

IF 1 Q3 MULTIDISCIPLINARY SCIENCES

Data in Brief Pub Date : 2024-09-21 DOI:10.1016/j.dib.2024.110956

Seunghyeon Wang, Ikchul Eum, Sangkyun Park, Jaejun Kim

{"title":"A semi-labelled dataset for fault detection in air handling units from a large-scale office","authors":"Seunghyeon Wang, Ikchul Eum, Sangkyun Park, Jaejun Kim","doi":"10.1016/j.dib.2024.110956","DOIUrl":null,"url":null,"abstract":"<div><div>Fault detection and diagnosis (FDD) in Air Handling Units (AHUs) ensure building functions such as energy efficiency and occupant comfort by quickly identifying and diagnosing faults. Combining deep learning with FDD has demonstrated high generalization ability in this field. To develop deep learning models, this research constructed a dataset sourced from real data collected from a large-scale office in South Korea. The raw AHU data were extracted from the Building Management System (BMS) at 1-h intervals, spanning from November 2023 to May 2024. The dataset was partially labeled by annotation experts, categorizing the data into six types: normal condition, supply fan fault, total heating pump fault, return air temperature sensor fault, supply air Temperature sensor fault, and valve position fault. Additionally, semi-supervised learning methods were applied as an application example using this constructed dataset. The main contributions of this dataset to the field are twofold. First, it represents a unique dataset sourced from the real operational data of a large-scale office, which is currently non-existent in this domain. Second, the dataset's expert labeling adds significant value by ensuring accurate fault classification. Therefore, we hope that this dataset will encourage the development of robust FDD techniques that are more suitable for real-world applications.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 110956"},"PeriodicalIF":1.0000,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340924009181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Fault detection and diagnosis (FDD) in Air Handling Units (AHUs) ensure building functions such as energy efficiency and occupant comfort by quickly identifying and diagnosing faults. Combining deep learning with FDD has demonstrated high generalization ability in this field. To develop deep learning models, this research constructed a dataset sourced from real data collected from a large-scale office in South Korea. The raw AHU data were extracted from the Building Management System (BMS) at 1-h intervals, spanning from November 2023 to May 2024. The dataset was partially labeled by annotation experts, categorizing the data into six types: normal condition, supply fan fault, total heating pump fault, return air temperature sensor fault, supply air Temperature sensor fault, and valve position fault. Additionally, semi-supervised learning methods were applied as an application example using this constructed dataset. The main contributions of this dataset to the field are twofold. First, it represents a unique dataset sourced from the real operational data of a large-scale office, which is currently non-existent in this domain. Second, the dataset's expert labeling adds significant value by ensuring accurate fault classification. Therefore, we hope that this dataset will encourage the development of robust FDD techniques that are more suitable for real-world applications.

查看原文本刊更多论文

用于检测大型办公室空气处理装置故障的半标签数据集

空气处理机组（AHU）中的故障检测与诊断（FDD）可通过快速识别和诊断故障，确保建筑物的能效和居住舒适度等功能。将深度学习与 FDD 相结合已在该领域展现出很高的泛化能力。为了开发深度学习模型，本研究构建了一个数据集，该数据集来源于从韩国大型办公室收集的真实数据。AHU 原始数据是从楼宇管理系统（BMS）中以 1 小时为间隔提取的，时间跨度为 2023 年 11 月至 2024 年 5 月。数据集由标注专家进行了部分标注，将数据分为六种类型：正常状态、送风机故障、总加热泵故障、回风温度传感器故障、送风温度传感器故障和阀门位置故障。此外，还利用所构建的数据集作为应用实例，应用了半监督学习方法。该数据集对该领域的主要贡献有两个方面。首先，它代表了一个独特的数据集，该数据集来源于大型办公室的真实运行数据，目前在该领域尚不存在。其次，数据集的专家标注确保了故障分类的准确性，从而增加了数据集的重要价值。因此，我们希望该数据集能鼓励开发更适合实际应用的稳健 FDD 技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Data in Brief MULTIDISCIPLINARY SCIENCES-

CiteScore

3.10

自引率

0.00%

发文量

996

审稿时长

70 days

期刊介绍： Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.