GBDT4CTRVis:用于广告点击率预测的梯度提升决策树可视化分析技术

IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Wenwen Gao, Shangsong Liu, Yi Zhou, Fengjie Wang, Feng Zhou, Min Zhu
{"title":"GBDT4CTRVis:用于广告点击率预测的梯度提升决策树可视化分析技术","authors":"Wenwen Gao, Shangsong Liu, Yi Zhou, Fengjie Wang, Feng Zhou, Min Zhu","doi":"10.1007/s12650-024-00984-0","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Gradient boosting decision tree (GBDT) is a mainstream model for advertisement click-through rate (CTR) prediction. Since the complex working mechanism of GBDT, advertising analysts often fail to analyze the decision-making and the iterative evolution process of a large number of decision trees, as well as to understand the impact of different features on the prediction results, which makes the model tuning quite challenging. To address these challenges, we propose a visual analytics system, GBDT4CTRVis, which helps advertising analysts understand the working mechanism of GBDT and facilitate model tuning through intuitive and interactive views. Specifically, we propose instance-level views to hierarchically explore the prediction results of advertising data, feature-level views to analyze the importance of features and their correlations from various perspectives, and model-level views to investigate the structure of representative decision trees and the temporal evolution of information gain during model prediction. We also provide multi-view interactions and panel control for flexible exploration. Finally, we evaluate GBDT4CTRVis through three case studies and expert evaluations. Feedback from experts indicated the usefulness and effectiveness of GBDT4CTRVis in helping to understand the model mechanism and tune the model.</p><h3 data-test=\"abstract-sub-heading\">Graphical abstract</h3>","PeriodicalId":54756,"journal":{"name":"Journal of Visualization","volume":"45 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"GBDT4CTRVis: visual analytics of gradient boosting decision tree for advertisement click-through rate prediction\",\"authors\":\"Wenwen Gao, Shangsong Liu, Yi Zhou, Fengjie Wang, Feng Zhou, Min Zhu\",\"doi\":\"10.1007/s12650-024-00984-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Gradient boosting decision tree (GBDT) is a mainstream model for advertisement click-through rate (CTR) prediction. Since the complex working mechanism of GBDT, advertising analysts often fail to analyze the decision-making and the iterative evolution process of a large number of decision trees, as well as to understand the impact of different features on the prediction results, which makes the model tuning quite challenging. To address these challenges, we propose a visual analytics system, GBDT4CTRVis, which helps advertising analysts understand the working mechanism of GBDT and facilitate model tuning through intuitive and interactive views. Specifically, we propose instance-level views to hierarchically explore the prediction results of advertising data, feature-level views to analyze the importance of features and their correlations from various perspectives, and model-level views to investigate the structure of representative decision trees and the temporal evolution of information gain during model prediction. We also provide multi-view interactions and panel control for flexible exploration. Finally, we evaluate GBDT4CTRVis through three case studies and expert evaluations. Feedback from experts indicated the usefulness and effectiveness of GBDT4CTRVis in helping to understand the model mechanism and tune the model.</p><h3 data-test=\\\"abstract-sub-heading\\\">Graphical abstract</h3>\",\"PeriodicalId\":54756,\"journal\":{\"name\":\"Journal of Visualization\",\"volume\":\"45 1\",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2024-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Visualization\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s12650-024-00984-0\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visualization","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12650-024-00984-0","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

摘要 梯度提升决策树(GBDT)是广告点击率(CTR)预测的主流模型。由于 GBDT 的工作机制复杂,广告分析师往往无法分析大量决策树的决策和迭代演化过程,也无法理解不同特征对预测结果的影响,这使得模型调优颇具挑战性。为了应对这些挑战,我们提出了一个可视化分析系统--GBDT4CTRVis,通过直观的交互式视图,帮助广告分析师理解 GBDT 的工作机制并促进模型调整。具体来说,我们提出了实例级视图来分层探索广告数据的预测结果,提出了特征级视图来从不同角度分析特征的重要性及其相关性,还提出了模型级视图来研究代表性决策树的结构以及模型预测过程中信息增益的时间演化。我们还提供了多视图交互和面板控制,以便灵活探索。最后,我们通过三个案例研究和专家评估对 GBDT4CTRVis 进行了评估。专家的反馈表明,GBDT4CTRVis 在帮助理解模型机制和调整模型方面非常有用和有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

GBDT4CTRVis: visual analytics of gradient boosting decision tree for advertisement click-through rate prediction

GBDT4CTRVis: visual analytics of gradient boosting decision tree for advertisement click-through rate prediction

Abstract

Gradient boosting decision tree (GBDT) is a mainstream model for advertisement click-through rate (CTR) prediction. Since the complex working mechanism of GBDT, advertising analysts often fail to analyze the decision-making and the iterative evolution process of a large number of decision trees, as well as to understand the impact of different features on the prediction results, which makes the model tuning quite challenging. To address these challenges, we propose a visual analytics system, GBDT4CTRVis, which helps advertising analysts understand the working mechanism of GBDT and facilitate model tuning through intuitive and interactive views. Specifically, we propose instance-level views to hierarchically explore the prediction results of advertising data, feature-level views to analyze the importance of features and their correlations from various perspectives, and model-level views to investigate the structure of representative decision trees and the temporal evolution of information gain during model prediction. We also provide multi-view interactions and panel control for flexible exploration. Finally, we evaluate GBDT4CTRVis through three case studies and expert evaluations. Feedback from experts indicated the usefulness and effectiveness of GBDT4CTRVis in helping to understand the model mechanism and tune the model.

Graphical abstract

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Visualization
Journal of Visualization COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY
CiteScore
3.40
自引率
5.90%
发文量
79
审稿时长
>12 weeks
期刊介绍: Visualization is an interdisciplinary imaging science devoted to making the invisible visible through the techniques of experimental visualization and computer-aided visualization. The scope of the Journal is to provide a place to exchange information on the latest visualization technology and its application by the presentation of latest papers of both researchers and technicians.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信