Improving Bounce Rate Prediction for Rare Queries by Leveraging Landing Page Signals

Yeshi Dolma, Raunak Kalani, Astha Agrawal, Saurav Basu
{"title":"Improving Bounce Rate Prediction for Rare Queries by Leveraging Landing Page Signals","authors":"Yeshi Dolma, Raunak Kalani, Astha Agrawal, Saurav Basu","doi":"10.1145/3442442.3453540","DOIUrl":null,"url":null,"abstract":"Bounce rate prediction for clicked ads in sponsored search advertising is crucial for improving the quality of ads shown to the user. Bounce rate represents the proportion of landing pages for clicked ads on which users spend less than a specified time signifying that the user did not find a possible match of their query intent with the landing page content. In the pay-per-click revenue model for search engines, higher bounce rates mean advertisers get charged without meaningful user engagement, which impacts user and advertiser retention in long term. In real-time search engine settings complex ML models are prohibitive due to stringent latency requirements. Also historical logs are ineffective for rare queries (tail) where the data is sparse, as well as for matching user intent to adcopy when the query and bidded keywords don’t exactly overlap (smart match). In this paper, we propose a real-time bounce rate prediction system that leverages lightweight features like modified tf, positional and proximity features computed from ad landing pages and improves prediction for rare queries. The model preserves privacy and uses no user based feature. The entire ensemble is trained on millions of examples from the offline user log of the Bing commercial search engine and improves the ranking metrics for tail queries and smart match by more than 2x compared to a model that only uses ad-copy-advertiser features.","PeriodicalId":129420,"journal":{"name":"Companion Proceedings of the Web Conference 2021","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion Proceedings of the Web Conference 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3442442.3453540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Bounce rate prediction for clicked ads in sponsored search advertising is crucial for improving the quality of ads shown to the user. Bounce rate represents the proportion of landing pages for clicked ads on which users spend less than a specified time signifying that the user did not find a possible match of their query intent with the landing page content. In the pay-per-click revenue model for search engines, higher bounce rates mean advertisers get charged without meaningful user engagement, which impacts user and advertiser retention in long term. In real-time search engine settings complex ML models are prohibitive due to stringent latency requirements. Also historical logs are ineffective for rare queries (tail) where the data is sparse, as well as for matching user intent to adcopy when the query and bidded keywords don’t exactly overlap (smart match). In this paper, we propose a real-time bounce rate prediction system that leverages lightweight features like modified tf, positional and proximity features computed from ad landing pages and improves prediction for rare queries. The model preserves privacy and uses no user based feature. The entire ensemble is trained on millions of examples from the offline user log of the Bing commercial search engine and improves the ranking metrics for tail queries and smart match by more than 2x compared to a model that only uses ad-copy-advertiser features.
利用登陆页信号改进罕见查询的跳出率预测
赞助搜索广告中点击广告的跳出率预测对于提高向用户展示的广告质量至关重要。跳出率表示用户在点击广告的登陆页面上花费的时间少于指定时间的比例,这表明用户没有发现他们的查询意图与登陆页面内容可能相匹配。在搜索引擎的按点击付费收入模式中,较高的跳出率意味着广告商在没有用户粘性的情况下收取费用,这将影响用户和广告商的长期留存率。在实时搜索引擎设置中,由于严格的延迟要求,复杂的ML模型是令人望而却步的。此外,对于数据稀疏的罕见查询(tail),以及当查询和出价关键字不完全重叠时(智能匹配)匹配用户意图时,历史日志是无效的。在本文中,我们提出了一个实时跳出率预测系统,该系统利用了从广告登陆页面计算的修改tf、位置和邻近特征等轻量级特征,并改进了对罕见查询的预测。该模型保护隐私,不使用基于用户的特性。整个集合是在必应商业搜索引擎的数百万个离线用户日志样本上进行训练的,与只使用广告复制广告主特征的模型相比,它将尾部查询和智能匹配的排名指标提高了2倍以上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信