Abdullah Mohammadi , Jalal A. Nasiri , Sohrab Effati
{"title":"Gravitational least squares twin support vector machine based on optimal angle for class imbalance learning","authors":"Abdullah Mohammadi , Jalal A. Nasiri , Sohrab Effati","doi":"10.1016/j.amc.2025.129705","DOIUrl":null,"url":null,"abstract":"<div><div>This paper introduces the Gravitational Least Squares Twin Support Vector Machine for Class Imbalance Learning (GLSTSVM-CIL), a novel binary classification method designed to address critical limitations in existing approaches for imbalanced large-scale datasets. Traditional methods like Fuzzy TSVM and KNN-based weighting fail to simultaneously capture both global positional relationships and local density characteristics of data points. Our proposed gravitational weighting function innovatively models data samples as masses influenced by their distance from class centroids and neighborhood density, effectively prioritizing representative points while suppressing outliers. The optimization framework uniquely incorporates angular constraints between hyperplanes to enhance structural risk control and generalization capability. For scalability, we reformulate the solution into a linear system solvable via conjugate gradient methods, avoiding computationally expensive matrix inversions. Comprehensive evaluations on 92 datasets (including synthetic, noisy, medical, text, and large-scale NDC benchmarks) demonstrate GLSTSVM-CIL’s superior performance, particularly in minority-class recognition where it achieves average F1-Score improvements over baseline methods. The model maintains robust Accuracy under high noise (20 %) and extreme class imbalance (ratio 20:1) while ables to process datasets up to 50,000 samples.</div></div>","PeriodicalId":55496,"journal":{"name":"Applied Mathematics and Computation","volume":"510 ","pages":"Article 129705"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematics and Computation","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S009630032500431X","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces the Gravitational Least Squares Twin Support Vector Machine for Class Imbalance Learning (GLSTSVM-CIL), a novel binary classification method designed to address critical limitations in existing approaches for imbalanced large-scale datasets. Traditional methods like Fuzzy TSVM and KNN-based weighting fail to simultaneously capture both global positional relationships and local density characteristics of data points. Our proposed gravitational weighting function innovatively models data samples as masses influenced by their distance from class centroids and neighborhood density, effectively prioritizing representative points while suppressing outliers. The optimization framework uniquely incorporates angular constraints between hyperplanes to enhance structural risk control and generalization capability. For scalability, we reformulate the solution into a linear system solvable via conjugate gradient methods, avoiding computationally expensive matrix inversions. Comprehensive evaluations on 92 datasets (including synthetic, noisy, medical, text, and large-scale NDC benchmarks) demonstrate GLSTSVM-CIL’s superior performance, particularly in minority-class recognition where it achieves average F1-Score improvements over baseline methods. The model maintains robust Accuracy under high noise (20 %) and extreme class imbalance (ratio 20:1) while ables to process datasets up to 50,000 samples.
期刊介绍:
Applied Mathematics and Computation addresses work at the interface between applied mathematics, numerical computation, and applications of systems – oriented ideas to the physical, biological, social, and behavioral sciences, and emphasizes papers of a computational nature focusing on new algorithms, their analysis and numerical results.
In addition to presenting research papers, Applied Mathematics and Computation publishes review articles and single–topics issues.