Slice: Scalable Linear Extreme Classifiers Trained on 100 Million Labels for Related Searches

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining Pub Date : 2019-01-30 DOI:10.1145/3289600.3290979

Himanshu Jain, V. Balasubramanian, Bhanu Chunduri, M. Varma

{"title":"Slice: Scalable Linear Extreme Classifiers Trained on 100 Million Labels for Related Searches","authors":"Himanshu Jain, V. Balasubramanian, Bhanu Chunduri, M. Varma","doi":"10.1145/3289600.3290979","DOIUrl":null,"url":null,"abstract":"This paper reformulates the problem of recommending related queries on a search engine as an extreme multi-label learning task. Extreme multi-label learning aims to annotate each data point with the most relevant subset of labels from an extremely large label set. Each of the top 100 million queries on Bing was treated as a separate label in the proposed reformulation and an extreme classifier was learnt which took the user's query as input and predicted the relevant subset of 100 million queries as output. Unfortunately, state-of-the-art extreme classifiers have not been shown to scale beyond 10 million labels and have poor prediction accuracies for queries. This paper therefore develops the Slice algorithm which can be accurately trained on low-dimensional, dense deep learning features popularly used to represent queries and which efficiently scales to 100 million labels and 240 million training points. Slice achieves this by reducing the training and prediction times from linear to logarithmic in the number of labels based on a novel negative sampling technique. This allows the proposed reformulation to address some of the limitations of traditional related searches approaches in terms of coverage, density and quality. Experiments on publically available extreme classification datasets with low-dimensional dense features as well as related searches datasets mined from the Bing logs revealed that slice could be more accurate than leading extreme classifiers while also scaling to 100 million labels. Furthermore, slice was found to improve the accuracy of recommendations by 10% as compared to state-of-the-art related searches techniques. Finally, when added to the ensemble in production in Bing, slice was found to increase the trigger coverage by 52%, the suggestion density by 33%, the overall success rate by 2.6% and the success rate for tail queries by 12.6%. Slice's source code can be downloaded from [21].","PeriodicalId":143253,"journal":{"name":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","volume":"424 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"118","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3289600.3290979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 118

Abstract

This paper reformulates the problem of recommending related queries on a search engine as an extreme multi-label learning task. Extreme multi-label learning aims to annotate each data point with the most relevant subset of labels from an extremely large label set. Each of the top 100 million queries on Bing was treated as a separate label in the proposed reformulation and an extreme classifier was learnt which took the user's query as input and predicted the relevant subset of 100 million queries as output. Unfortunately, state-of-the-art extreme classifiers have not been shown to scale beyond 10 million labels and have poor prediction accuracies for queries. This paper therefore develops the Slice algorithm which can be accurately trained on low-dimensional, dense deep learning features popularly used to represent queries and which efficiently scales to 100 million labels and 240 million training points. Slice achieves this by reducing the training and prediction times from linear to logarithmic in the number of labels based on a novel negative sampling technique. This allows the proposed reformulation to address some of the limitations of traditional related searches approaches in terms of coverage, density and quality. Experiments on publically available extreme classification datasets with low-dimensional dense features as well as related searches datasets mined from the Bing logs revealed that slice could be more accurate than leading extreme classifiers while also scaling to 100 million labels. Furthermore, slice was found to improve the accuracy of recommendations by 10% as compared to state-of-the-art related searches techniques. Finally, when added to the ensemble in production in Bing, slice was found to increase the trigger coverage by 52%, the suggestion density by 33%, the overall success rate by 2.6% and the success rate for tail queries by 12.6%. Slice's source code can be downloaded from [21].

查看原文本刊更多论文

Slice:在1亿个相关搜索标签上训练的可扩展线性极端分类器

本文将在搜索引擎上推荐相关查询的问题重新表述为一个极端的多标签学习任务。极端多标签学习的目的是用一个非常大的标签集中最相关的标签子集来注释每个数据点。在提议的重新表述中，Bing上排名前1亿个查询中的每一个都被视为一个单独的标签，并学习了一个极端分类器，该分类器将用户的查询作为输入，并预测1亿个查询的相关子集作为输出。不幸的是，最先进的极端分类器还没有被证明可以扩展到超过1000万个标签，并且对查询的预测精度很差。因此，本文开发了Slice算法，该算法可以准确地训练用于表示查询的低维，密集的深度学习特征，并有效地扩展到1亿个标签和2.4亿个训练点。Slice通过基于一种新的负采样技术，将训练和预测时间从线性减少到对数，从而实现了这一目标。这使得拟议的重新拟订能够解决传统相关搜索方法在覆盖范围、密度和质量方面的一些限制。在公开的具有低维密度特征的极端分类数据集以及从必应日志中挖掘的相关搜索数据集上进行的实验表明，slice可以比领先的极端分类器更准确，同时也可以扩展到1亿个标签。此外，与最先进的相关搜索技术相比，slice被发现将推荐的准确性提高了10%。最后，当将slice添加到Bing生产中的集合中时，我们发现它将触发覆盖率提高了52%，建议密度提高了33%，总体成功率提高了2.6%，尾部查询的成功率提高了12.6%。Slice的源代码可从[21]下载。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

自引率

0.00%

发文量