Hong-Kyun Bae , Jiyeon Kim , Jongwuk Lee , Sang-Wook Kim
{"title":"DUET: Dually guided knowledge distillation from explicit feedback","authors":"Hong-Kyun Bae , Jiyeon Kim , Jongwuk Lee , Sang-Wook Kim","doi":"10.1016/j.inffus.2025.103098","DOIUrl":null,"url":null,"abstract":"<div><div>Various <em>knowledge distillation</em> (KD) methods for recommender systems have been recently introduced to achieve two goals: (i) obtaining an inference time shorter than the cumbersome model (<em>i.e.</em>, <em>teacher</em>) and (ii) providing accuracy higher than the compact model (<em>i.e.</em>, <em>student</em>). Despite their success, they solely focus on developing KD methods with <em>implicit feedback</em>. We argue that handling CF with <em>explicit feedback</em> is also crucial, representing the different degrees of user preferences. Towards this goal, we propose a novel KD framework for recommender systems, namely <em>Dually gUided knowlEdge disTillation</em> (<span>DUET</span>). We first observe that explicit feedback is interpreted as two types of user preferences, <em>i.e.</em>, <em>pre-use preference</em> and <em>post-use preference</em>. Motivated by such characteristics of explicit feedback, we aim to <em>fuse knowledge</em> from the teacher’s pre- and post-use preferences by employing <em>two teachers</em> (<em>i.e.</em>, teacher #1 and teacher #2). <em>Teacher #1</em>, trained with <em>pre-use preferences</em>, selects some items among unrated ones. <em>Teacher #2</em>, trained with <em>post-use preferences</em>, determines the <em>soft labels</em> (<em>i.e.</em>, predicted post-use preferences) of those items chosen by teacher #1. Finally, the student is trained with both the hard labels (<em>i.e.</em>, observed post-use preferences) of rated items and the soft labels (<em>i.e.</em>, predicted post-use preferences by teacher #2) of the items selected by teacher #1. Extensive experimental results show that our <span>DUET</span> framework consistently outperforms state-of-the-art KD methods on three benchmark datasets. Notably, it beats RD, CD, DE-RRD, BD, and TD up to 13.6%, 18.6%, 16.8%, 9.6%, and 18.6% in terms of NDCG@10, respectively.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"120 ","pages":"Article 103098"},"PeriodicalIF":14.7000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156625352500171X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Various knowledge distillation (KD) methods for recommender systems have been recently introduced to achieve two goals: (i) obtaining an inference time shorter than the cumbersome model (i.e., teacher) and (ii) providing accuracy higher than the compact model (i.e., student). Despite their success, they solely focus on developing KD methods with implicit feedback. We argue that handling CF with explicit feedback is also crucial, representing the different degrees of user preferences. Towards this goal, we propose a novel KD framework for recommender systems, namely Dually gUided knowlEdge disTillation (DUET). We first observe that explicit feedback is interpreted as two types of user preferences, i.e., pre-use preference and post-use preference. Motivated by such characteristics of explicit feedback, we aim to fuse knowledge from the teacher’s pre- and post-use preferences by employing two teachers (i.e., teacher #1 and teacher #2). Teacher #1, trained with pre-use preferences, selects some items among unrated ones. Teacher #2, trained with post-use preferences, determines the soft labels (i.e., predicted post-use preferences) of those items chosen by teacher #1. Finally, the student is trained with both the hard labels (i.e., observed post-use preferences) of rated items and the soft labels (i.e., predicted post-use preferences by teacher #2) of the items selected by teacher #1. Extensive experimental results show that our DUET framework consistently outperforms state-of-the-art KD methods on three benchmark datasets. Notably, it beats RD, CD, DE-RRD, BD, and TD up to 13.6%, 18.6%, 16.8%, 9.6%, and 18.6% in terms of NDCG@10, respectively.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.