{"title":"MFFTD: A Multiscale Feature Fusion Transformer Detector for Electricity Theft Based on Semi-Supervised Learning","authors":"Yufeng Wang;Zhijie Wu;Jianhua Ma;Qun Jin","doi":"10.1109/TIM.2025.3552857","DOIUrl":null,"url":null,"abstract":"The wide deployment of advanced metering infrastructure (AMI) in power systems allows utility companies to automatically and accurately collect and process the time-series load profiles of households, but meanwhile incurs the severe electricity theft (ET) that some illegal residential users may manipulate their electricity consumptions to reduce their billings. Although, due to the powerful ability of modeling long-range dependencies in sequential data, transformer has been widely used for time-series modeling including ET detection (ETD), the significant weakpoint lies in that it only considers the attention weights between either points or prepatched subsequences (i.e., patches) of fixed size within the input sequence, which cannot fully characterize the relationships among multiscale temporal patches and lead to suboptimal detection performance. To address the above issue, based on self-supervised feature extraction and supervised fine-tuning, our work proposes a novel multiscale feature fusion transformer encoder (TE)-based ETD framework, MFFTD. Specifically, our work’s contributions are following. First, in MFFTD, a hierarchical patching enhanced TE (HPTE) is explicitly designed, in which each layer patches the input sequence with variable patch size. Then, through hierarchically stacked multiple HPTE layers, the feature combining multiscale patches can be effectively extracted. Second, considering the constraint that only a small percentage of labeled theft samples are practically available, our work first pretrains the structure parameters of MFFTD through a self-supervised pretext task of forecasting the randomly masked segments in time series. Then, the small percentage of labeled anomalous samples is used to fine-tune the MFFTD model. Extensive experiments on multiple real datasets demonstrate our proposed MFFTD scheme outperforms the state-of-the-art (SOTA) transformer-based supervised and semi-supervised ETD methods.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-10"},"PeriodicalIF":5.6000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10934042/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The wide deployment of advanced metering infrastructure (AMI) in power systems allows utility companies to automatically and accurately collect and process the time-series load profiles of households, but meanwhile incurs the severe electricity theft (ET) that some illegal residential users may manipulate their electricity consumptions to reduce their billings. Although, due to the powerful ability of modeling long-range dependencies in sequential data, transformer has been widely used for time-series modeling including ET detection (ETD), the significant weakpoint lies in that it only considers the attention weights between either points or prepatched subsequences (i.e., patches) of fixed size within the input sequence, which cannot fully characterize the relationships among multiscale temporal patches and lead to suboptimal detection performance. To address the above issue, based on self-supervised feature extraction and supervised fine-tuning, our work proposes a novel multiscale feature fusion transformer encoder (TE)-based ETD framework, MFFTD. Specifically, our work’s contributions are following. First, in MFFTD, a hierarchical patching enhanced TE (HPTE) is explicitly designed, in which each layer patches the input sequence with variable patch size. Then, through hierarchically stacked multiple HPTE layers, the feature combining multiscale patches can be effectively extracted. Second, considering the constraint that only a small percentage of labeled theft samples are practically available, our work first pretrains the structure parameters of MFFTD through a self-supervised pretext task of forecasting the randomly masked segments in time series. Then, the small percentage of labeled anomalous samples is used to fine-tune the MFFTD model. Extensive experiments on multiple real datasets demonstrate our proposed MFFTD scheme outperforms the state-of-the-art (SOTA) transformer-based supervised and semi-supervised ETD methods.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.