Soft ComputingPub Date : 2024-07-25DOI: 10.1007/s00500-024-09931-5
Salim Rezvani, Farhad Pourpanah, Chee Peng Lim, Q. M. Jonathan Wu
{"title":"Methods for class-imbalanced learning with support vector machines: a review and an empirical evaluation","authors":"Salim Rezvani, Farhad Pourpanah, Chee Peng Lim, Q. M. Jonathan Wu","doi":"10.1007/s00500-024-09931-5","DOIUrl":"https://doi.org/10.1007/s00500-024-09931-5","url":null,"abstract":"<p>This paper presents a review on methods for class-imbalanced learning with the Support Vector Machine (SVM) and its variants. We first explain the structure of SVM and its variants and discuss their inefficiency in learning with class-imbalanced data sets. We introduce a hierarchical categorization of SVM-based models with respect to class-imbalanced learning. Specifically, we categorize SVM-based models into re-sampling, algorithmic, and fusion methods, and discuss the principles of the representative models in each category. In addition, we conduct a series of empirical evaluations to compare the performances of various representative SVM-based models in each category using benchmark imbalanced data sets, ranging from low to high imbalanced ratios. Our findings reveal that while algorithmic methods are less time-consuming owing to no data pre-processing requirements, fusion methods, which combine both re-sampling and algorithmic approaches, generally perform the best, but with a higher computational load. A discussion on research gaps and future research directions is provided.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"16 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09910-w
Tran Thanh Dai, Nguyen Long Giang, Vu Duc Thi, Tran Thi Ngan, Hoang Thi Minh Chau, Le Hoang Son
{"title":"A new approach for attribute reduction from decision table based on intuitionistic fuzzy topology","authors":"Tran Thanh Dai, Nguyen Long Giang, Vu Duc Thi, Tran Thi Ngan, Hoang Thi Minh Chau, Le Hoang Son","doi":"10.1007/s00500-024-09910-w","DOIUrl":"https://doi.org/10.1007/s00500-024-09910-w","url":null,"abstract":"<p>Most of the current attribute reduction methods use the measure to define the reduct, such as the positive region of rough set theory (RS), information entropy, and distance. However, the size of the reduct based on the measures is still limited. To cope with this problem, we propose a new approach of attribute reduction based on using the intuitionistic fuzzy topology (IFT). Firstly, a new IFT structure based on the pre-order relation and the intuitionistic fuzzy base (IF-base) structure is introduced. Secondly, a new measure is proposed to evaluate the significance of the attribute based on the IF subbase. Finally, the new reduction algorithms based on the IF-base filter and filter-wrapper methods are presented. The theoretical and experimental results show that the proposed method is efficient in terms of size and accuracy of the reduct. Specifically, the reduct of the F_IFT algorithm has an average size of 50% smaller, and the FW_IFT algorithm has an average accuracy of 10% greater than those of the related algorithms. Significantly, the algorithm FW_IFT can very remove noisy attributes. The classification accuracy of the reduct is 15% higher than that of the original set of attributes.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"63 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09847-0
Banghee So, Emiliano A. Valdez
{"title":"SAMME.C2 algorithm for imbalanced multi-class classification","authors":"Banghee So, Emiliano A. Valdez","doi":"10.1007/s00500-024-09847-0","DOIUrl":"https://doi.org/10.1007/s00500-024-09847-0","url":null,"abstract":"<p>Classification predictive modeling involves the accurate assignment of observations in a dataset to target classes or categories. Real-world classification problems with severely imbalanced class distributions have increased substantially in recent years. In such cases, significantly fewer observations are available for minority classes to learn from than for majority classes. Despite this sparsity, the minority class is often considered as the more interesting class, yet the development of a scientific learning algorithm that is suitable for these observations presents numerous challenges. In this study, we further explore the merits of an effective multi-class classification algorithm known as <span>SAMME.C2</span> that is specialized for handling severely imbalanced classes. This innovative method blends the flexible mechanics of the boosting techniques from the <span>SAMME</span> algorithm, which is a multi-class classifier, and the <span>Ada.C2</span> algorithm, which is a cost-sensitive binary classifier that is designed to address highly imbalanced classes. We establish a scientific and statistical formulation of the <span>SAMME.C2</span> algorithm, together with providing and explaining the resulting procedure. We demonstrate the consistently superior performance of this algorithm through numerical experiments as well as empirical studies.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"51 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multi-objective optimization approach for overlapping dynamic community detection","authors":"Sondos Bahadori, Mansooreh Mirzaie, Maryam Nooraei Abadeh","doi":"10.1007/s00500-024-09895-6","DOIUrl":"https://doi.org/10.1007/s00500-024-09895-6","url":null,"abstract":"<p>Community detection is a valuable tool for studying the function and dynamic structure of most real-world networks. Existing techniques either concentrate on the network's topological structure or node properties without adequately addressing the dynamic aspect. As a result, in this research, we present a unique technique called Multi-Objective Optimization Overlapping Dynamic Community Detection (MOOODCD) that leverages both the topological structure and node attributes of dynamic networks. By incorporating the Dirichlet distribution to control network dynamics, we formulate dynamic community detection as a non-negative matrix factorization problem. The block coordinate ascent method is used to estimate the latent elements of the model. Our experiments on artificial and real networks indicate that MOOODCD detects overlapping communities in dynamic networks with acceptable precision and scalability.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"73 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09892-9
Kadir Karakaya
{"title":"Inference on process capability index $$S_{pmk}$$ for a new lifetime distribution","authors":"Kadir Karakaya","doi":"10.1007/s00500-024-09892-9","DOIUrl":"https://doi.org/10.1007/s00500-024-09892-9","url":null,"abstract":"<p>In various applied disciplines, the modeling of continuous data often requires the use of flexible continuous distributions. Meeting this demand calls for the introduction of new continuous distributions that possess desirable characteristics. This paper introduces a new continuous distribution. Several estimators for estimating the unknown parameters of the new distribution are discussed and their efficiency is assessed through Monte Carlo simulations. Furthermore, the process capability index <span>(S_{pmk})</span> is examined when the underlying distribution is the proposed distribution. The maximum likelihood estimation of the <span>(S_{pmk})</span> is also studied. The asymptotic confidence interval is also constructed for <span>(S_{pmk})</span>. The simulation results indicate that estimators for both the unknown parameters of the new distribution and the <span>(S_{pmk})</span> provide reasonable results. Some practical analyses are also performed on both the new distribution and the <span>(S_{pmk})</span>. The results of the conducted data analysis indicate that the new distribution yields effective outcomes in modeling lifetime data in the literature. Similarly, the data analyses performed for <span>(S_{pmk})</span> illustrate that the new distribution can be utilized for process capability indices by quality controllers.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"821 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09950-2
Francisco Javier Maldonado Carrascosa, Doraid Seddiki, Antonio Jiménez Sánchez, Sebastián García Galán, Manuel Valverde Ibáñez, Adam Marchewka
{"title":"Multi-objective optimization of virtual machine migration among cloud data centers","authors":"Francisco Javier Maldonado Carrascosa, Doraid Seddiki, Antonio Jiménez Sánchez, Sebastián García Galán, Manuel Valverde Ibáñez, Adam Marchewka","doi":"10.1007/s00500-024-09950-2","DOIUrl":"https://doi.org/10.1007/s00500-024-09950-2","url":null,"abstract":"<p>Workload migration among cloud data centers is currently an evolving task that requires substantial advancements. The incorporation of fuzzy systems holds potential for enhancing performance and efficiency within cloud computing. This study addresses a multi-objective problem wherein the goal is to maximize the interpretability and the percentage of renewable energy consumed by a fuzzy meta-scheduler system in cloud scenarios. To accomplish this objective, the present research proposes a novel approach utilizing a multi-objective Knowledge Acquisition with a Swarm Intelligence Approach algorithm. Additionally, it takes advantage of a framework built on CloudSim, which includes virtual machine migration capabilities based on an expert system. Furthermore, a hierarchical fuzzy system is employed to assess rule base interpretability, along with another multi-objective algorithm, named Non-dominated Sorting Genetic Algorithm II. The framework and hierarchical system are employed to perform various simulation results concerning renewable energy and interpretability, while the algorithms aim to enhance the system’s performance and interpretability. Empirical results demonstrate that it is possible to improve the performance of cloud data centers while improving the interpretability of the corresponding fuzzy rule-based system. The proposed multi-objective algorithm shows comparable or superior performance to the genetic algorithm across diverse scenarios. The simulation results indicate that improvements in cloud data center performance can be achieved while enhancing system interpretability. The average improvement in the interpretability index ranges from 0.6 to 6%, with a corresponding increase in renewable energy utilization ranging from 5 to 6%.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"43 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Eno classification and regression neural networks for numerical approximation of discontinuous flow problems","authors":"Vikas Kumar Jayswal, Prashant Kumar Pandey, Ritesh Kumar Dubey","doi":"10.1007/s00500-024-09944-0","DOIUrl":"https://doi.org/10.1007/s00500-024-09944-0","url":null,"abstract":"<p>Learning high order non-oscillatory polynomial approximation procedures which form the backbone of high order numerical solution of partial differential equations is challenging. The major issue is to pose these procedures as a learning problem and generate suitable synthetic data set which suffice learning it with small neural networks. In this work, we pose an arc-length based essentially non-oscillatory (ENOL) reconstruction algorithm as machine learning problem. A novel way to construct the synthetic data using ENOL algorithm along with basic smooth and piece-wise continuous functions is given. Small vanilla regression and classification neural networks are trained to learn third order (ENOL) polynomial approximation procedure. The metric of trained ENO classification and regression networks is presented and commented. These trained models are implemented in numerical solver to compute the solution of test problems of hyperbolic conservation laws. The presented numerical results show that ENO classification network gives results comparable to the exact ENOL reconstruction whereas ENO regression network performs poorly both in terms of convergence and resolving the discontinuities.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"43 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09925-3
Saumya Ranjan Jena, Archana Senapati
{"title":"Explicit and implicit numerical investigations of one-dimensional heat equation based on spline collocation and Thomas algorithm","authors":"Saumya Ranjan Jena, Archana Senapati","doi":"10.1007/s00500-024-09925-3","DOIUrl":"https://doi.org/10.1007/s00500-024-09925-3","url":null,"abstract":"<p>This study uses the cubic spline method to solve the one-dimensional (1D) (one spatial and one temporal dimension) heat problem (a parametric linear partial differential equation) numerically using both explicit and implicit strategies. The set of simultaneous equations acquired in both the explicit and implicit method may be solved using the Thomas algorithm from the tridiagonal dominating matrix, and the spline offers a continuous solution. The results are implemented with very fine meshes and with relatively small-time steps. Using mesh refinement, it was possible to find better temperature distribution in the thin bar. Five numerical examples are used to support the efficiency and accuracy of the current scheme. The findings are also compared with analytical results and other results in terms of error and error norms <span>({L}_{2})</span> and <span>({L}_{infty })</span>. The Von-Neuman technique is used to analyse stability. The truncation error of both systems is calculated and determined to have a convergence of order <span>(Oleft( {h + Delta t^{2} } right).)</span></p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"178 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09893-8
Peixin Huang, Guo Zhou, Yongquan Zhou, Qifang Luo
{"title":"Interval-based multi-objective metaheuristic honey badger algorithm","authors":"Peixin Huang, Guo Zhou, Yongquan Zhou, Qifang Luo","doi":"10.1007/s00500-024-09893-8","DOIUrl":"https://doi.org/10.1007/s00500-024-09893-8","url":null,"abstract":"<p>Optimization problem involving interval parameters and multiple conflicting objectives are called multi-objective optimization problems with interval parameters (IMOPs), which are common and hard to be solved effectively in practical applications. An interval multi-objective honey badger algorithm (IMOHBA) is proposed to address the IMOPs in this paper. Firstly, the <span>(mu)</span> metric is employed to assess the Pareto dominance relationship among interval individuals, which reflects the quality of the optimal solutions. Secondly, the crowding distance suitable for the interval objective is utilized to reflect the distribution of the optimal solution. Finally, the candidate solutions are ranked and selected by the non-dominated sorting method. To validate the performance of IMOHBA, it is tested on 19 benchmark IMOPs as well as an interval multi-objective scheduling problem for underwater wireless sensor networks and compared with three state-of-the-art algorithms. The experimental results demonstrate the superiority and strong competitiveness of IMOHBA in addressing IMOPs, exhibiting improved convergence and broader exploration capabilities of the solution space. These findings further validate the effectiveness and feasibility of IMOHBA, highlighting its unique advantage in solving IMOPs.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"1 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141785952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Soft ComputingPub Date : 2024-07-24DOI: 10.1007/s00500-024-09813-w
Xin Tian, Jianhua Hu, Yan Song, Guoliang Wei
{"title":"An improved particle swarm optimization algorithm with distributed time-delays of evolved acceleration coefficients and adaptive weights","authors":"Xin Tian, Jianhua Hu, Yan Song, Guoliang Wei","doi":"10.1007/s00500-024-09813-w","DOIUrl":"https://doi.org/10.1007/s00500-024-09813-w","url":null,"abstract":"<p>Particle swarm optimization (PSO) is a classical computational method that optimizes a problem by iteratively trying to find the optimal solution. It still suffers somes defects such as poor local search ability, low search accuracy and premature convergence, especially in high-dimensional complex problems. In order to address these issues, this paper has proposed a novel PSO algorithm with distributed delays of adaptive weights and evolved acceleration coefficients (PSO-DWC). The main idea of the proposed improved PSO algorithm is three-fold: (1) a mechanism is introduced to evaluate the current evolutionary state by evolutionary factors of the swarm and to predict the next state by a probability transition matrix; (2) distributed time-varying time-delays are added into the velocity updated model; (3) adaptive inertia weight varies according to evolutionary factors, which describes the population distribution information; and newly-introduced evolved acceleration coefficients are determined by the predict next evolutionary state of the swarm. Owing to the promising issues mentioned above, the PSO-DWC algorithm has the advantages of keeping the diversity of particles, balancing the local and global search abilities and reaching to an acceptable solution. Experiments on twenty well-known benchmark functions have demonstrated that the proposed PSO-DWC algorithm has a superior performance over other five well-known PSO algorithms in high dimensional search space. Statistical significance tests verify the superiority of the new algorithm. Therefore it can be concluded that the novel PSO-DWC algorithm is able to solve the optimization problems with powerful global search and efficient convergence.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"42 1","pages":""},"PeriodicalIF":4.1,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141779087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}