Seungwan Park , Taewoong Ryu , Doyoon Kim , Doyoung Kim , Hanju Kim , Myungha Cho , Unil Yun
{"title":"基于滑动窗口的高效用占用模式数据流挖掘","authors":"Seungwan Park , Taewoong Ryu , Doyoon Kim , Doyoung Kim , Hanju Kim , Myungha Cho , Unil Yun","doi":"10.1016/j.ins.2025.122243","DOIUrl":null,"url":null,"abstract":"<div><div>High utility-based pattern mining has been proposed to analyze information by considering not only the frequency of items but also their quantity and profit. Among these, studies on high utility occupancy-based patterns have emerged, which consider the occupancy measure reflecting the share of a pattern belonging to transactions. Furthermore, as the necessity to process real-time stream data has become more critical, a method to discover high utility occupancy-based patterns in stream information has been presented recently. However, this recent method handles all accumulated data on data stream environments. Since all previously accumulated data are processed, the volume of data to be processed steadily increases over time, leading to a decline in efficiency over time. In addition, it becomes difficult to give emphasis on recent data. Consequently, these methods become less suitable for practical applications. To surmount the drawbacks, we introduce a novel approach for mining high utility occupancy patterns, employing a sliding window technique to efficiently process stream data. By focusing on fixed-size, most recent data within the window, our method effectively reflects the trends in the latest data while exhibiting improved efficiency compared to previous approaches. Extensive performance evaluations demonstrate the efficacy of the proposed method against prior methods regarding runtime, memory usage, scalability, and sensitivity. Moreover, statistical tests confirm that our approach accurately extracts the exact number of patterns without pattern loss or duplication.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"716 ","pages":"Article 122243"},"PeriodicalIF":8.1000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sliding window-based high utility occupancy pattern mining for data streams\",\"authors\":\"Seungwan Park , Taewoong Ryu , Doyoon Kim , Doyoung Kim , Hanju Kim , Myungha Cho , Unil Yun\",\"doi\":\"10.1016/j.ins.2025.122243\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>High utility-based pattern mining has been proposed to analyze information by considering not only the frequency of items but also their quantity and profit. Among these, studies on high utility occupancy-based patterns have emerged, which consider the occupancy measure reflecting the share of a pattern belonging to transactions. Furthermore, as the necessity to process real-time stream data has become more critical, a method to discover high utility occupancy-based patterns in stream information has been presented recently. However, this recent method handles all accumulated data on data stream environments. Since all previously accumulated data are processed, the volume of data to be processed steadily increases over time, leading to a decline in efficiency over time. In addition, it becomes difficult to give emphasis on recent data. Consequently, these methods become less suitable for practical applications. To surmount the drawbacks, we introduce a novel approach for mining high utility occupancy patterns, employing a sliding window technique to efficiently process stream data. By focusing on fixed-size, most recent data within the window, our method effectively reflects the trends in the latest data while exhibiting improved efficiency compared to previous approaches. Extensive performance evaluations demonstrate the efficacy of the proposed method against prior methods regarding runtime, memory usage, scalability, and sensitivity. Moreover, statistical tests confirm that our approach accurately extracts the exact number of patterns without pattern loss or duplication.</div></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":\"716 \",\"pages\":\"Article 122243\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025525003755\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025525003755","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Sliding window-based high utility occupancy pattern mining for data streams
High utility-based pattern mining has been proposed to analyze information by considering not only the frequency of items but also their quantity and profit. Among these, studies on high utility occupancy-based patterns have emerged, which consider the occupancy measure reflecting the share of a pattern belonging to transactions. Furthermore, as the necessity to process real-time stream data has become more critical, a method to discover high utility occupancy-based patterns in stream information has been presented recently. However, this recent method handles all accumulated data on data stream environments. Since all previously accumulated data are processed, the volume of data to be processed steadily increases over time, leading to a decline in efficiency over time. In addition, it becomes difficult to give emphasis on recent data. Consequently, these methods become less suitable for practical applications. To surmount the drawbacks, we introduce a novel approach for mining high utility occupancy patterns, employing a sliding window technique to efficiently process stream data. By focusing on fixed-size, most recent data within the window, our method effectively reflects the trends in the latest data while exhibiting improved efficiency compared to previous approaches. Extensive performance evaluations demonstrate the efficacy of the proposed method against prior methods regarding runtime, memory usage, scalability, and sensitivity. Moreover, statistical tests confirm that our approach accurately extracts the exact number of patterns without pattern loss or duplication.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.