基于自适应三向采样和邻域粗糙集的快速属性和尺度选择

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-09-27 DOI:10.1016/j.asoc.2025.113966

Shujiao Liao , Nan Zhou , Ling Wei , Weiping Ding , Yidong Lin

{"title":"基于自适应三向采样和邻域粗糙集的快速属性和尺度选择","authors":"Shujiao Liao , Nan Zhou , Ling Wei , Weiping Ding , Yidong Lin","doi":"10.1016/j.asoc.2025.113966","DOIUrl":null,"url":null,"abstract":"<div><div>As a key issue in knowledge reduction for multi-scale data, attribute and scale selection has attracted increasing attention in recent years. However, with the rapid growth of data volumes, most existing methods are inefficient for large-scale multi-scale data and struggle to effectively handle heterogeneous multi-scale data, significantly limiting their practical applications. To address this situation, this paper proposes a rapid attribute and scale selection method to deal with large nominal-and-numerical mixed multi-scale decision systems (NN-MDSs). First, the theory and algorithm of adaptive incremental support vector data description (AISVDD) are presented. The AISVDD algorithm overcomes the limitations of traditional support vector data description methods and efficiently processes both class-balanced and class-imbalanced data. By using the algorithm, support vectors can be quickly obtained from large data. Next, an adaptive three-way sampling technique is derived by combining AISVDD and three-way decision. With this technique, support vectors are extracted as sampling results and put into the boundary region, and outliers are seen as noise and put into the negative region. This significantly reduces the data size and improves the data quality. Then, a neighborhood rough set model is built to describe NN-MDSs. Multiple concepts and properties are discussed in the model. Finally, a heuristic attribute and scale selection algorithm is designed to simultaneously choose attributes and scales from the sampled NN-MDS. Detailed experiments demonstrate the effectiveness and superiority of the proposed method. The method performs better than state-of-the-art attribute and scale selection methods on both computational efficiency and classification performance under six benchmark classifiers. It is powerful in handling large NN-MDSs with complex characteristics. This work provides new insights into the complex multi-scale data processing.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 113966"},"PeriodicalIF":6.6000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rapid attribute and scale selection with adaptive three-way sampling and neighborhood rough set\",\"authors\":\"Shujiao Liao , Nan Zhou , Ling Wei , Weiping Ding , Yidong Lin\",\"doi\":\"10.1016/j.asoc.2025.113966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As a key issue in knowledge reduction for multi-scale data, attribute and scale selection has attracted increasing attention in recent years. However, with the rapid growth of data volumes, most existing methods are inefficient for large-scale multi-scale data and struggle to effectively handle heterogeneous multi-scale data, significantly limiting their practical applications. To address this situation, this paper proposes a rapid attribute and scale selection method to deal with large nominal-and-numerical mixed multi-scale decision systems (NN-MDSs). First, the theory and algorithm of adaptive incremental support vector data description (AISVDD) are presented. The AISVDD algorithm overcomes the limitations of traditional support vector data description methods and efficiently processes both class-balanced and class-imbalanced data. By using the algorithm, support vectors can be quickly obtained from large data. Next, an adaptive three-way sampling technique is derived by combining AISVDD and three-way decision. With this technique, support vectors are extracted as sampling results and put into the boundary region, and outliers are seen as noise and put into the negative region. This significantly reduces the data size and improves the data quality. Then, a neighborhood rough set model is built to describe NN-MDSs. Multiple concepts and properties are discussed in the model. Finally, a heuristic attribute and scale selection algorithm is designed to simultaneously choose attributes and scales from the sampled NN-MDS. Detailed experiments demonstrate the effectiveness and superiority of the proposed method. The method performs better than state-of-the-art attribute and scale selection methods on both computational efficiency and classification performance under six benchmark classifiers. It is powerful in handling large NN-MDSs with complex characteristics. This work provides new insights into the complex multi-scale data processing.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"185 \",\"pages\":\"Article 113966\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625012797\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625012797","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

属性和尺度选择作为多尺度数据知识约简的关键问题，近年来受到越来越多的关注。然而，随着数据量的快速增长，大多数现有方法对大规模多尺度数据的处理效率低下，难以有效处理异构多尺度数据，极大地限制了它们的实际应用。针对这种情况，本文提出了一种快速属性和尺度选择方法来处理大型标称和数值混合多尺度决策系统（nn - mds）。首先，介绍了自适应增量支持向量数据描述（AISVDD）的理论和算法。AISVDD算法克服了传统支持向量数据描述方法的局限性，能够有效地处理类平衡和类不平衡数据。该算法可以快速地从大数据中获取支持向量。然后，将AISVDD和三向决策相结合，推导出一种自适应三向采样技术。该技术将提取支持向量作为采样结果放入边界区域，将离群值视为噪声放入负区域。这大大减少了数据大小并提高了数据质量。然后，建立邻域粗糙集模型来描述nn - mds。模型中讨论了多个概念和属性。最后，设计了一种启发式属性和尺度选择算法，从采样的NN-MDS中同时选择属性和尺度。详细的实验证明了该方法的有效性和优越性。在6个基准分类器下，该方法在计算效率和分类性能上都优于最先进的属性和尺度选择方法。它在处理具有复杂特征的大型神经网络mds方面功能强大。这项工作为复杂的多尺度数据处理提供了新的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Rapid attribute and scale selection with adaptive three-way sampling and neighborhood rough set

As a key issue in knowledge reduction for multi-scale data, attribute and scale selection has attracted increasing attention in recent years. However, with the rapid growth of data volumes, most existing methods are inefficient for large-scale multi-scale data and struggle to effectively handle heterogeneous multi-scale data, significantly limiting their practical applications. To address this situation, this paper proposes a rapid attribute and scale selection method to deal with large nominal-and-numerical mixed multi-scale decision systems (NN-MDSs). First, the theory and algorithm of adaptive incremental support vector data description (AISVDD) are presented. The AISVDD algorithm overcomes the limitations of traditional support vector data description methods and efficiently processes both class-balanced and class-imbalanced data. By using the algorithm, support vectors can be quickly obtained from large data. Next, an adaptive three-way sampling technique is derived by combining AISVDD and three-way decision. With this technique, support vectors are extracted as sampling results and put into the boundary region, and outliers are seen as noise and put into the negative region. This significantly reduces the data size and improves the data quality. Then, a neighborhood rough set model is built to describe NN-MDSs. Multiple concepts and properties are discussed in the model. Finally, a heuristic attribute and scale selection algorithm is designed to simultaneously choose attributes and scales from the sampled NN-MDS. Detailed experiments demonstrate the effectiveness and superiority of the proposed method. The method performs better than state-of-the-art attribute and scale selection methods on both computational efficiency and classification performance under six benchmark classifiers. It is powerful in handling large NN-MDSs with complex characteristics. This work provides new insights into the complex multi-scale data processing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.