{"title":"Mamba-DDPM-BSA: Diffusion model based boundary sampling algorithm for imbalanced classification","authors":"Fan Zhang , Quan Yuan , Xinhong Zhang","doi":"10.1016/j.eswa.2025.126926","DOIUrl":null,"url":null,"abstract":"<div><div>Data category imbalance is one of the major challenges in the field of medical image classification. This imbalance seriously affects the accuracy and reliability of the classification model, posing potential risks to doctors’ diagnosis and treatment. This paper proposes a Mamba-DDPM-BSA method to address the imbalanced classification issue of medical image. Firstly, the generative model Mamba-DDPM is designed for the synthesis of medical image samples. It utilizes Mamba’s global modeling capability and linear computational efficiency to improve the quality of generated samples by improving DDPM (Denoising Diffusion Probabilistic Model). Secondly, by oversampling training samples in boundary regions, the proposed Boundary Sampling Algorithm (BSA) enables synthesizer focuses more on decision boundary areas when fitting sample distributions. This approach generates more samples near the decision boundary, pushing the decision boundary that originally intrudes into the minority class distribution towards the true distribution. Finally, a Mamba-DDPM-BSA method is proposed, which adopts an interactive synthesis method and makes full use of diffusion generation model and Boundary Sampling Algorithm to interact with the classification model, aiming to synthesize images that target the defects of the classification model to improve the discriminative ability and robustness of the classifier. Experiments based on HAM10000 data set show that Mamba-DDPM-BSA reaches 81.03%, 82.14%, and 82.71% on Matthew’s correlation coefficient, Balanced Accuracy, and Macro F1, respectively. The proposed method is superior to the traditional imbalanced classification method.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"274 ","pages":"Article 126926"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425005482","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Data category imbalance is one of the major challenges in the field of medical image classification. This imbalance seriously affects the accuracy and reliability of the classification model, posing potential risks to doctors’ diagnosis and treatment. This paper proposes a Mamba-DDPM-BSA method to address the imbalanced classification issue of medical image. Firstly, the generative model Mamba-DDPM is designed for the synthesis of medical image samples. It utilizes Mamba’s global modeling capability and linear computational efficiency to improve the quality of generated samples by improving DDPM (Denoising Diffusion Probabilistic Model). Secondly, by oversampling training samples in boundary regions, the proposed Boundary Sampling Algorithm (BSA) enables synthesizer focuses more on decision boundary areas when fitting sample distributions. This approach generates more samples near the decision boundary, pushing the decision boundary that originally intrudes into the minority class distribution towards the true distribution. Finally, a Mamba-DDPM-BSA method is proposed, which adopts an interactive synthesis method and makes full use of diffusion generation model and Boundary Sampling Algorithm to interact with the classification model, aiming to synthesize images that target the defects of the classification model to improve the discriminative ability and robustness of the classifier. Experiments based on HAM10000 data set show that Mamba-DDPM-BSA reaches 81.03%, 82.14%, and 82.71% on Matthew’s correlation coefficient, Balanced Accuracy, and Macro F1, respectively. The proposed method is superior to the traditional imbalanced classification method.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.