Rapid attribute and scale selection with adaptive three-way sampling and neighborhood rough set

IF 6.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Shujiao Liao , Nan Zhou , Ling Wei , Weiping Ding , Yidong Lin
{"title":"Rapid attribute and scale selection with adaptive three-way sampling and neighborhood rough set","authors":"Shujiao Liao ,&nbsp;Nan Zhou ,&nbsp;Ling Wei ,&nbsp;Weiping Ding ,&nbsp;Yidong Lin","doi":"10.1016/j.asoc.2025.113966","DOIUrl":null,"url":null,"abstract":"<div><div>As a key issue in knowledge reduction for multi-scale data, attribute and scale selection has attracted increasing attention in recent years. However, with the rapid growth of data volumes, most existing methods are inefficient for large-scale multi-scale data and struggle to effectively handle heterogeneous multi-scale data, significantly limiting their practical applications. To address this situation, this paper proposes a rapid attribute and scale selection method to deal with large nominal-and-numerical mixed multi-scale decision systems (NN-MDSs). First, the theory and algorithm of adaptive incremental support vector data description (AISVDD) are presented. The AISVDD algorithm overcomes the limitations of traditional support vector data description methods and efficiently processes both class-balanced and class-imbalanced data. By using the algorithm, support vectors can be quickly obtained from large data. Next, an adaptive three-way sampling technique is derived by combining AISVDD and three-way decision. With this technique, support vectors are extracted as sampling results and put into the boundary region, and outliers are seen as noise and put into the negative region. This significantly reduces the data size and improves the data quality. Then, a neighborhood rough set model is built to describe NN-MDSs. Multiple concepts and properties are discussed in the model. Finally, a heuristic attribute and scale selection algorithm is designed to simultaneously choose attributes and scales from the sampled NN-MDS. Detailed experiments demonstrate the effectiveness and superiority of the proposed method. The method performs better than state-of-the-art attribute and scale selection methods on both computational efficiency and classification performance under six benchmark classifiers. It is powerful in handling large NN-MDSs with complex characteristics. This work provides new insights into the complex multi-scale data processing.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 113966"},"PeriodicalIF":6.6000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625012797","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

As a key issue in knowledge reduction for multi-scale data, attribute and scale selection has attracted increasing attention in recent years. However, with the rapid growth of data volumes, most existing methods are inefficient for large-scale multi-scale data and struggle to effectively handle heterogeneous multi-scale data, significantly limiting their practical applications. To address this situation, this paper proposes a rapid attribute and scale selection method to deal with large nominal-and-numerical mixed multi-scale decision systems (NN-MDSs). First, the theory and algorithm of adaptive incremental support vector data description (AISVDD) are presented. The AISVDD algorithm overcomes the limitations of traditional support vector data description methods and efficiently processes both class-balanced and class-imbalanced data. By using the algorithm, support vectors can be quickly obtained from large data. Next, an adaptive three-way sampling technique is derived by combining AISVDD and three-way decision. With this technique, support vectors are extracted as sampling results and put into the boundary region, and outliers are seen as noise and put into the negative region. This significantly reduces the data size and improves the data quality. Then, a neighborhood rough set model is built to describe NN-MDSs. Multiple concepts and properties are discussed in the model. Finally, a heuristic attribute and scale selection algorithm is designed to simultaneously choose attributes and scales from the sampled NN-MDS. Detailed experiments demonstrate the effectiveness and superiority of the proposed method. The method performs better than state-of-the-art attribute and scale selection methods on both computational efficiency and classification performance under six benchmark classifiers. It is powerful in handling large NN-MDSs with complex characteristics. This work provides new insights into the complex multi-scale data processing.
基于自适应三向采样和邻域粗糙集的快速属性和尺度选择
属性和尺度选择作为多尺度数据知识约简的关键问题,近年来受到越来越多的关注。然而,随着数据量的快速增长,大多数现有方法对大规模多尺度数据的处理效率低下,难以有效处理异构多尺度数据,极大地限制了它们的实际应用。针对这种情况,本文提出了一种快速属性和尺度选择方法来处理大型标称和数值混合多尺度决策系统(nn - mds)。首先,介绍了自适应增量支持向量数据描述(AISVDD)的理论和算法。AISVDD算法克服了传统支持向量数据描述方法的局限性,能够有效地处理类平衡和类不平衡数据。该算法可以快速地从大数据中获取支持向量。然后,将AISVDD和三向决策相结合,推导出一种自适应三向采样技术。该技术将提取支持向量作为采样结果放入边界区域,将离群值视为噪声放入负区域。这大大减少了数据大小并提高了数据质量。然后,建立邻域粗糙集模型来描述nn - mds。模型中讨论了多个概念和属性。最后,设计了一种启发式属性和尺度选择算法,从采样的NN-MDS中同时选择属性和尺度。详细的实验证明了该方法的有效性和优越性。在6个基准分类器下,该方法在计算效率和分类性能上都优于最先进的属性和尺度选择方法。它在处理具有复杂特征的大型神经网络mds方面功能强大。这项工作为复杂的多尺度数据处理提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信