{"title":"A hybrid SMOTE and Trans-CWGAN for data imbalance in real operational AHU AFDD: A case study of an auditorium building","authors":"Seunghyeon Wang","doi":"10.1016/j.enbuild.2025.116447","DOIUrl":null,"url":null,"abstract":"<div><div>Class imbalance remains a significant challenge in Automated Fault Detection and Diagnosis (AFDD) for Air Handling Units (AHUs), as normal operating conditions significantly outnumber rare fault events. Prior studies mainly relied on simulated or laboratory-generated datasets, limiting their applicability to real-world scenarios due to insufficient operational complexity. This study introduces a novel hybrid data augmentation method combining the Synthetic Minority Over-sampling Technique (SMOTE) with Transfer Conditional Wasserstein Generative Adversarial Network (Trans-CWGAN), applied to real operational data collected over one year from an auditorium building equipped with 13 AHUs. Through hyperparameter optimization, a total of 1212 distinct datasets were generated across augmentation strategies. Among these strategies, the SMOTE-based Trans-CWGAN approach consistently delivered superior results. Specifically, TabNet achieved the highest performance, with a mean F1 score of 98.68 % and accuracy of 98.98 %, followed by RNN-LSTM (F1: 96.56 %, accuracy: 95.84 %). Even the DT model significantly improved from its initial baseline F1 score of 73.53 %. These findings underscore the effectiveness of integrating SMOTE and Trans-CWGAN to mitigate class imbalance, highlighting its strong potential for practical deployment in real-world HVAC monitoring systems.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"348 ","pages":"Article 116447"},"PeriodicalIF":7.1000,"publicationDate":"2025-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778825011776","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Class imbalance remains a significant challenge in Automated Fault Detection and Diagnosis (AFDD) for Air Handling Units (AHUs), as normal operating conditions significantly outnumber rare fault events. Prior studies mainly relied on simulated or laboratory-generated datasets, limiting their applicability to real-world scenarios due to insufficient operational complexity. This study introduces a novel hybrid data augmentation method combining the Synthetic Minority Over-sampling Technique (SMOTE) with Transfer Conditional Wasserstein Generative Adversarial Network (Trans-CWGAN), applied to real operational data collected over one year from an auditorium building equipped with 13 AHUs. Through hyperparameter optimization, a total of 1212 distinct datasets were generated across augmentation strategies. Among these strategies, the SMOTE-based Trans-CWGAN approach consistently delivered superior results. Specifically, TabNet achieved the highest performance, with a mean F1 score of 98.68 % and accuracy of 98.98 %, followed by RNN-LSTM (F1: 96.56 %, accuracy: 95.84 %). Even the DT model significantly improved from its initial baseline F1 score of 73.53 %. These findings underscore the effectiveness of integrating SMOTE and Trans-CWGAN to mitigate class imbalance, highlighting its strong potential for practical deployment in real-world HVAC monitoring systems.
期刊介绍:
An international journal devoted to investigations of energy use and efficiency in buildings
Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.