Yabo Ni , Yueqiu Wu , Jingyi Li , Anxiang Zeng , Han Yu , Xiaoxiao Li
{"title":"基于动态掩码的电子商务点击率预测特征交互建模","authors":"Yabo Ni , Yueqiu Wu , Jingyi Li , Anxiang Zeng , Han Yu , Xiaoxiao Li","doi":"10.1016/j.engappai.2025.111184","DOIUrl":null,"url":null,"abstract":"<div><div>Learning feature interactions is crucial to click-through rate (CTR) prediction in e-commerce applications. The interaction between target items and user behavior history is one of the most important factors. In recent years, Deep Neural Networks (DNNs) designed to automatically learn implicit nonlinear interactions from the original sparse features have been widely adopted for industrial CTR prediction tasks. Implicit feature interactions often cannot retain the full representation capacity of the original feature interactions. Explicit modeling approaches, such as Cartesian product methods and matrix factorization-based models, either suffer from high computational costs or have limited expressive power. Recent deep learning-based methods improve feature interaction modeling but often overlook the enhancement of feature representations and fail to consider the varying importance of different interactions, limiting model performance. To address these challenges, in this paper, we report the design, implementation and deployment of <u>F</u>eature <u>D</u>ynamic <u>M</u>asking (<span>FDM</span>), a novel framework for modeling feature interactions in deep CTR prediction models. <span>FDM</span> dynamically generates unique feature representations for different interactions using learnable masks, enabling information sharing while maintaining efficiency. Additionally, it introduces a weighting module to capture the importance of different interactions. <span>FDM</span> framework is simple, effective and easy to deploy in real-world search and recommender systems. Since its deployment in the search system of Shopee Pte Ltd., Singapore, it has achieved a commercially significant 2.76% improvement in Order Per User (OPU) and 2.45% improvement in Gross Merchandise Volume (GMV).</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"157 ","pages":"Article 111184"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic masking-based feature interaction modeling for e-commerce click-through rate prediction\",\"authors\":\"Yabo Ni , Yueqiu Wu , Jingyi Li , Anxiang Zeng , Han Yu , Xiaoxiao Li\",\"doi\":\"10.1016/j.engappai.2025.111184\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Learning feature interactions is crucial to click-through rate (CTR) prediction in e-commerce applications. The interaction between target items and user behavior history is one of the most important factors. In recent years, Deep Neural Networks (DNNs) designed to automatically learn implicit nonlinear interactions from the original sparse features have been widely adopted for industrial CTR prediction tasks. Implicit feature interactions often cannot retain the full representation capacity of the original feature interactions. Explicit modeling approaches, such as Cartesian product methods and matrix factorization-based models, either suffer from high computational costs or have limited expressive power. Recent deep learning-based methods improve feature interaction modeling but often overlook the enhancement of feature representations and fail to consider the varying importance of different interactions, limiting model performance. To address these challenges, in this paper, we report the design, implementation and deployment of <u>F</u>eature <u>D</u>ynamic <u>M</u>asking (<span>FDM</span>), a novel framework for modeling feature interactions in deep CTR prediction models. <span>FDM</span> dynamically generates unique feature representations for different interactions using learnable masks, enabling information sharing while maintaining efficiency. Additionally, it introduces a weighting module to capture the importance of different interactions. <span>FDM</span> framework is simple, effective and easy to deploy in real-world search and recommender systems. Since its deployment in the search system of Shopee Pte Ltd., Singapore, it has achieved a commercially significant 2.76% improvement in Order Per User (OPU) and 2.45% improvement in Gross Merchandise Volume (GMV).</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"157 \",\"pages\":\"Article 111184\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625011856\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625011856","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Dynamic masking-based feature interaction modeling for e-commerce click-through rate prediction
Learning feature interactions is crucial to click-through rate (CTR) prediction in e-commerce applications. The interaction between target items and user behavior history is one of the most important factors. In recent years, Deep Neural Networks (DNNs) designed to automatically learn implicit nonlinear interactions from the original sparse features have been widely adopted for industrial CTR prediction tasks. Implicit feature interactions often cannot retain the full representation capacity of the original feature interactions. Explicit modeling approaches, such as Cartesian product methods and matrix factorization-based models, either suffer from high computational costs or have limited expressive power. Recent deep learning-based methods improve feature interaction modeling but often overlook the enhancement of feature representations and fail to consider the varying importance of different interactions, limiting model performance. To address these challenges, in this paper, we report the design, implementation and deployment of Feature Dynamic Masking (FDM), a novel framework for modeling feature interactions in deep CTR prediction models. FDM dynamically generates unique feature representations for different interactions using learnable masks, enabling information sharing while maintaining efficiency. Additionally, it introduces a weighting module to capture the importance of different interactions. FDM framework is simple, effective and easy to deploy in real-world search and recommender systems. Since its deployment in the search system of Shopee Pte Ltd., Singapore, it has achieved a commercially significant 2.76% improvement in Order Per User (OPU) and 2.45% improvement in Gross Merchandise Volume (GMV).
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.