Samir Brahim Belhaouari , Ashhadul Islam , Khelil Kassoul , Ala Al-Fuqaha , Abdesselam Bouzerdoum
{"title":"KNNOR-Reg: A python package for oversampling in imbalanced regression","authors":"Samir Brahim Belhaouari , Ashhadul Islam , Khelil Kassoul , Ala Al-Fuqaha , Abdesselam Bouzerdoum","doi":"10.1016/j.simpa.2024.100740","DOIUrl":null,"url":null,"abstract":"<div><div>KNNOR-Reg is a Python package designed to address the challenge of imbalanced regression. While popular Python packages exist for tackling imbalanced classification, support for imbalanced regression remains limited. Imbalanced regression involves the underrepresentation of important ranges within a continuous target variable. KNNOR-Reg implements an oversampling technique that generates synthetic samples through interpolation between minority class samples and their nearest neighbors. The labels for synthetic samples are computed based on the inverse distance-weighted average of the nearest neighbors’ labels. KNNOR-Reg offers a user-friendly and extensible Python implementation for oversampling imbalanced regression data, aiming to reduce regressor bias and enhance model outcomes.</div></div>","PeriodicalId":29771,"journal":{"name":"Software Impacts","volume":"23 ","pages":"Article 100740"},"PeriodicalIF":1.3000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Impacts","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2665963824001283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
KNNOR-Reg is a Python package designed to address the challenge of imbalanced regression. While popular Python packages exist for tackling imbalanced classification, support for imbalanced regression remains limited. Imbalanced regression involves the underrepresentation of important ranges within a continuous target variable. KNNOR-Reg implements an oversampling technique that generates synthetic samples through interpolation between minority class samples and their nearest neighbors. The labels for synthetic samples are computed based on the inverse distance-weighted average of the nearest neighbors’ labels. KNNOR-Reg offers a user-friendly and extensible Python implementation for oversampling imbalanced regression data, aiming to reduce regressor bias and enhance model outcomes.