ARFIS：用于重尾分布回归的自适应稳健模型

IF 8.1 1区计算机科学 N/A COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Sciences Pub Date : 2024-08-20 DOI:10.1016/j.ins.2024.121344

{"title":"ARFIS：用于重尾分布回归的自适应稳健模型","authors":"","doi":"10.1016/j.ins.2024.121344","DOIUrl":null,"url":null,"abstract":"<div><p>As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.</p></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":null,"pages":null},"PeriodicalIF":8.1000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ARFIS: An adaptive robust model for regression with heavy-tailed distribution\",\"authors\":\"\",\"doi\":\"10.1016/j.ins.2024.121344\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.</p></div>\",\"PeriodicalId\":51063,\"journal\":{\"name\":\"Information Sciences\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2024-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0020025524012581\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"N/A\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524012581","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"N/A","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

由于重尾分布在许多实际应用中无处不在，稳健回归已被广泛应用于机器学习中，并在处理重尾分布时表现出优越性。目前现有的鲁棒回归方法都是基于特征的独立性假设，忽略了特征之间的交互性，这可能会导致大多数数据集无法满足该假设，从而导致泛化效果不佳。实际上，由两个或多个特征组成的交互特征在很多应用中特别有助于获取高阶信息。本文提出了一种用于重尾分布回归的新型自适应鲁棒特征交互选择模型，称为自适应鲁棒特征交互选择（ARFIS）。首先，我们考虑了成对特征交互，即在重尾分布回归中用特征乘积增强特征向量。其次，我们提出了基于量化损失的特征交互选择模型，并使用不同的正则化器来学习参数。从理论上证明了所提模型 ARFIS 的一致性，并提出了求解所提模型的高效算法。最后，在仿真数据、UCI 数据集和现实世界数据集上的实验结果验证了我们提出的模型具有良好的准确性、可解释性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ARFIS: An adaptive robust model for regression with heavy-tailed distribution

As heavy-tailed distributions are ubiquitous in many real applications, robust regression has been extensively applied in machine learning and exhibits the superiority in deal with heavy-tailed distribution. Current existing robust methods for regression are based on independence assumption of features and ignore interaction between them, which may lead to poor generalization due to most datasets unsatisfying the assumption. Actually, interaction features formed by the composition of two or more features are particularly helpful to obtain the high-order information in many applications. In this paper, we propose a novel adaptive robust feature interaction selection model for regression with heavy-tailed distributions, termed Adaptive Robust Feature Interaction Selection (ARFIS). Firstly, we consider pairwise feature interaction by augmenting a feature vector with product of features for regression with heavy tailed distribution. Secondly, we propose feature interaction selection models based on quantile loss with different regularizers to learn parameters. The consistency of the proposed model ARFIS is theoretically proven, and an efficient algorithm is presented for solving proposed model. Finally, experimental results on simulation data, UCI datasets and a real-world dataset validate good accuracy, interpretability and robustness of our proposed models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Information Sciences 工程技术-计算机：信息系统

CiteScore

14.00

自引率

17.30%

发文量

1322

审稿时长

10.4 months

期刊介绍： Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.