Yong Wang , Huadong Zhou , Gui-Fu Lu , Cuiyun Gao , Shuai Meng
{"title":"Improving user-oriented fairness in recommendation via data augmentation: Don’t worry about inactive users","authors":"Yong Wang , Huadong Zhou , Gui-Fu Lu , Cuiyun Gao , Shuai Meng","doi":"10.1016/j.jss.2025.112387","DOIUrl":null,"url":null,"abstract":"<div><div>A recommendation system is considered unfair when it does not perform equally well for different user groups according to users’ specific attributes. In recent research, the user groups are divided into active user group and inactive user group according to the number of interaction records in a recommendation system. Intuitively, increasing the number of inactive users’ interaction records would improve the fairness of the recommendation system. Existing data augmentation techniques can increase interaction records, however they usually fail to deeply mine user interaction patterns and fail to generate context-related feedback, which cannot effectively improve the quality of recommendations for inactive users. To resolve the problem, we use the Large Language Models (LLMs) to mine user historical interaction records to achieve data augmentation, which improve the quality of recommendations for inactive user groups. Experimental results on four classic baseline recommendation algorithms show that our data augmentation method for the inactive user group can effectively alleviate the poor recommendation quality caused by the low interaction with the recommendation system, reduce the recommendation quality gap with active user group, and further improve the user group fairness of the recommendation system.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"225 ","pages":"Article 112387"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016412122500055X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
A recommendation system is considered unfair when it does not perform equally well for different user groups according to users’ specific attributes. In recent research, the user groups are divided into active user group and inactive user group according to the number of interaction records in a recommendation system. Intuitively, increasing the number of inactive users’ interaction records would improve the fairness of the recommendation system. Existing data augmentation techniques can increase interaction records, however they usually fail to deeply mine user interaction patterns and fail to generate context-related feedback, which cannot effectively improve the quality of recommendations for inactive users. To resolve the problem, we use the Large Language Models (LLMs) to mine user historical interaction records to achieve data augmentation, which improve the quality of recommendations for inactive user groups. Experimental results on four classic baseline recommendation algorithms show that our data augmentation method for the inactive user group can effectively alleviate the poor recommendation quality caused by the low interaction with the recommendation system, reduce the recommendation quality gap with active user group, and further improve the user group fairness of the recommendation system.
期刊介绍:
The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to:
•Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution
•Agile, model-driven, service-oriented, open source and global software development
•Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems
•Human factors and management concerns of software development
•Data management and big data issues of software systems
•Metrics and evaluation, data mining of software development resources
•Business and economic aspects of software development processes
The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.