使用基于 k-轮廓的递归深度图聚类提取搜索任务

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2024-10-28 DOI:10.1016/j.engappai.2024.109501

{"title":"使用基于 k-轮廓的递归深度图聚类提取搜索任务","authors":"","doi":"10.1016/j.engappai.2024.109501","DOIUrl":null,"url":null,"abstract":"<div><div>Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Search task extraction using k-contour based recurrent deep graph clustering\",\"authors\":\"\",\"doi\":\"10.1016/j.engappai.2024.109501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197624016592\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016592","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

搜索引擎必须准确预测用户的隐含意图，才能有效指导用户的在线搜索体验，并协助他们完成任务。用户通过在搜索引擎上执行各种查询来获取所需信息，从而创建了有时间顺序的查询日志。无论这些查询是来自同一会话中的不同任务，还是来自不同会话中的同一任务，搜索任务提取都会将具有相同意图的查询归入独特的群组。准确识别用户意图可以提高搜索引导过程的性能，包括查询建议、个性化搜索和广告检索。许多现有方法都侧重于创建显示查询之间关系的图表。但是，这些方法通常使用简单的基于阈值的技术对图进行聚类，而不是利用图的拓扑结构特征。最近的研究引入了深度聚类层，以防止模型规模随着查询次数的增加而扩大。然而，这些模型依赖于标注数据，忽略了语言模型的现代嵌入。我们提出了一种新颖的基于 k-contour 的图卷积网络连接邻近聚类层（CoGCN-C-CL）架构，它利用图的拓扑特性，无需标注数据即可对图进行聚类。CoGCN-CL 可同时学习查询表示和搜索任务。k-contours 标识了图的不同区域，而图卷积网络 (GCN) 则利用了这些区域内节点之间的交互作用。实验结果表明，CoGCN-CL 在常用搜索任务数据集上的表现优于现有的最先进搜索任务聚类方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Search task extraction using k-contour based recurrent deep graph clustering

Search engines must accurately predict the implicit intent of users to effectively guide their online search experience and assist them in completing their tasks. Users create time-ordered query logs by performing various queries on search engines to access desired information. Search task extraction groups queries with the same intent into unique clusters, whether these queries come from different tasks within the same session or from the same task across different sessions. Accurate identification of user intent improves the performance of search-guiding processes, including query suggestion, personalized search, and advertisement retrieval. Many existing methods focus on creating graphs that show relationships between queries. However, these methods typically cluster the graph using simple threshold-based techniques rather than leveraging graph topological structure features. Recent studies have introduced deep clustering layers to prevent the model size from growing as the number of queries increases. However, these models rely on labeled data and overlook modern embeddings from language models. We propose a novel k-contour-based graph convolutional network connective proximity clustering layer (CoGCN-C-CL) architecture that clusters graphs without requiring labeled data by leveraging graph topological properties. CoGCN-C-CL simultaneously learns query representations and search tasks. The k-contours identify distinct regions of the graph, while the graph convolutional network (GCN) exploits interactions between nodes within these regions. Experimental results demonstrate that CoGCN-C-CL outperforms existing state-of-the-art search task clustering methods on frequently used search task datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.