{"title":"Intensive Class Imbalance Learning in Drifting Data Streams","authors":"Muhammad Usman;Huanhuan Chen","doi":"10.1109/TETCI.2024.3399657","DOIUrl":null,"url":null,"abstract":"Streaming data analysis faces two primary challenges: concept drifts and class imbalance. The co-occurrence of virtual drifts and class imbalance is a common real-world scenario requiring dedicated solutions. This paper presents Intensive Class Imbalance Learning (ICIL), a novel supervised classification method for virtually drifting data streams. ICIL facilitates the detection of virtual drifts through a feature-sensitive change detection method. It calibrates the data over time to resolve within-class imbalance, overlaps, and small sample size problems. A weighted voting ensemble is proposed for enhanced performance, wherein weights are constantly updated based on the recent performance of the member classifiers. Experiments are conducted on 14 synthetic and real-world data streams to demonstrate the efficacy of the proposed method. The comparative analysis against 11 state-of-the-art methods shows that the proposed method outperforms the other methods in 9/14 data streams on the G-mean metric.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"8 5","pages":"3503-3517"},"PeriodicalIF":5.3000,"publicationDate":"2024-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computational Intelligence","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10533716/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Streaming data analysis faces two primary challenges: concept drifts and class imbalance. The co-occurrence of virtual drifts and class imbalance is a common real-world scenario requiring dedicated solutions. This paper presents Intensive Class Imbalance Learning (ICIL), a novel supervised classification method for virtually drifting data streams. ICIL facilitates the detection of virtual drifts through a feature-sensitive change detection method. It calibrates the data over time to resolve within-class imbalance, overlaps, and small sample size problems. A weighted voting ensemble is proposed for enhanced performance, wherein weights are constantly updated based on the recent performance of the member classifiers. Experiments are conducted on 14 synthetic and real-world data streams to demonstrate the efficacy of the proposed method. The comparative analysis against 11 state-of-the-art methods shows that the proposed method outperforms the other methods in 9/14 data streams on the G-mean metric.
期刊介绍:
The IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI) publishes original articles on emerging aspects of computational intelligence, including theory, applications, and surveys.
TETCI is an electronics only publication. TETCI publishes six issues per year.
Authors are encouraged to submit manuscripts in any emerging topic in computational intelligence, especially nature-inspired computing topics not covered by other IEEE Computational Intelligence Society journals. A few such illustrative examples are glial cell networks, computational neuroscience, Brain Computer Interface, ambient intelligence, non-fuzzy computing with words, artificial life, cultural learning, artificial endocrine networks, social reasoning, artificial hormone networks, computational intelligence for the IoT and Smart-X technologies.