Learning to Augment Imbalanced Data for Re-ranking Models

Proceedings of the 30th ACM International Conference on Information & Knowledge Management Pub Date : 2021-10-26 DOI:10.1145/3459637.3482364

Zimeng Qiu, Yingchun Jian, Qingguo Chen, Lijun Zhang

{"title":"Learning to Augment Imbalanced Data for Re-ranking Models","authors":"Zimeng Qiu, Yingchun Jian, Qingguo Chen, Lijun Zhang","doi":"10.1145/3459637.3482364","DOIUrl":null,"url":null,"abstract":"The conventional solution to learning to rank problems ranks individual documents by prediction scores greedily. Recent emerged re-ranking models, which take as input initial lists, aim to capture document interdependencies and directly generate the optimal ordered lists. Typically, a re-ranking model is learned from a set of labeled data, which can achieve favorable performance on average. However, it can be suboptimal for individual queries because the available training data is usually highly imbalanced. This problem is challenging due to the absence of informative data for some queries and furthermore, the lack of a good data augmentation policy. In this paper, we propose a novel method named Learning to Augment (LTA), which mitigates the imbalance issue through learning to augment the initial lists for re-ranking models. Specifically, we first design a data generation model based on Gaussian Mixture Variational Autoencoder (GMVAE) for generating informative data. GMVAE imposes a mixture of Gaussians on the latent space, which allows it to cluster queries in an unsupervised manner and then generate new data with different query types using the learned components. Then, to obtain a good augmentation strategy (instead of heuristics), we design a teacher model that consists of two intelligent agents to determine how to generate new data for a given list and how to rank both the raw data and generated data to produce augmented lists, respectively. The teacher model leverages the feedback from the re-ranking model to optimize its augmentation policy by means of reinforcement learning. Our method offers a general learning paradigm that is applicable to both supervised and reinforced re-ranking models. Experimental results on benchmark learning to rank datasets show that our proposed method can significantly improve the performance of re-ranking models.","PeriodicalId":405296,"journal":{"name":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 30th ACM International Conference on Information & Knowledge Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459637.3482364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The conventional solution to learning to rank problems ranks individual documents by prediction scores greedily. Recent emerged re-ranking models, which take as input initial lists, aim to capture document interdependencies and directly generate the optimal ordered lists. Typically, a re-ranking model is learned from a set of labeled data, which can achieve favorable performance on average. However, it can be suboptimal for individual queries because the available training data is usually highly imbalanced. This problem is challenging due to the absence of informative data for some queries and furthermore, the lack of a good data augmentation policy. In this paper, we propose a novel method named Learning to Augment (LTA), which mitigates the imbalance issue through learning to augment the initial lists for re-ranking models. Specifically, we first design a data generation model based on Gaussian Mixture Variational Autoencoder (GMVAE) for generating informative data. GMVAE imposes a mixture of Gaussians on the latent space, which allows it to cluster queries in an unsupervised manner and then generate new data with different query types using the learned components. Then, to obtain a good augmentation strategy (instead of heuristics), we design a teacher model that consists of two intelligent agents to determine how to generate new data for a given list and how to rank both the raw data and generated data to produce augmented lists, respectively. The teacher model leverages the feedback from the re-ranking model to optimize its augmentation policy by means of reinforcement learning. Our method offers a general learning paradigm that is applicable to both supervised and reinforced re-ranking models. Experimental results on benchmark learning to rank datasets show that our proposed method can significantly improve the performance of re-ranking models.

查看原文本刊更多论文

学习为重新排序模型增加不平衡数据

学习对问题进行排序的传统解决方案是贪婪地通过预测分数对单个文档进行排序。最近出现的重排序模型以初始列表为输入，旨在捕获文档的相互依赖关系并直接生成最优有序列表。通常，重新排序模型是从一组标记数据中学习的，平均而言可以获得较好的性能。然而，对于单个查询，它可能不是最优的，因为可用的训练数据通常是高度不平衡的。由于某些查询缺乏信息数据，而且缺乏良好的数据增强策略，因此这个问题具有挑战性。在本文中，我们提出了一种新的方法，即学习增强(LTA)，它通过学习增强模型的初始列表来缓解不平衡问题。具体来说，我们首先设计了一个基于高斯混合变分自编码器(GMVAE)的数据生成模型来生成信息数据。GMVAE在潜在空间上施加了一种混合的高斯函数，这允许它以一种无监督的方式对查询进行聚类，然后使用学习到的组件生成不同查询类型的新数据。然后，为了获得良好的增强策略(而不是启发式方法)，我们设计了一个由两个智能代理组成的教师模型，以确定如何为给定列表生成新数据，以及如何对原始数据和生成数据进行排序以生成增强列表。教师模型利用重新排序模型的反馈，通过强化学习优化其增强策略。我们的方法提供了一种通用的学习范式，适用于监督式和强化式重新排序模型。基于基准学习对数据集进行排序的实验结果表明，本文提出的方法可以显著提高重排序模型的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 30th ACM International Conference on Information & Knowledge Management

自引率

0.00%

发文量