LLM4TAP: llm增强的TAP规则建议

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-02-05 DOI:10.1109/JIOT.2025.3532977

Gang Wu;Liang Hu;Yuxiao Hu;Xingbo Xiong;Feng Wang

{"title":"LLM4TAP: llm增强的TAP规则建议","authors":"Gang Wu;Liang Hu;Yuxiao Hu;Xingbo Xiong;Feng Wang","doi":"10.1109/JIOT.2025.3532977","DOIUrl":null,"url":null,"abstract":"Trigger-action programming (TAP) is an Internet of Things (IoT) paradigm that enables nonprofessional end-users to automate smart devices by formulating rules, such as “IF you leave home, THEN turn off lights.” As the number of possible rules increases, manually browsing these rules becomes increasingly time-consuming for users. Recently, graph-based recommendation systems have shown promise in automatically suggesting rules, yet they face two issues. First, these studies struggle to identify and differentiate users’ demands (e.g., turning off lights) and intentions (e.g., energy saving). Second, they overlook the issue of sparse user-rule interactions. In this article, we propose LLM4TAP, a large language model (LLM) enhanced TAP rule recommendation framework, to address these issues. Prior to LLM4TAP, a user-rule graph is constructed to represent the interactions between users and rules. Within LLM4TAP, singular value decomposition is first employed to generate an augmented graph, strengthening global structural relationships between users and rules. Next, the reasoning capabilities of LLMs are utilized to infer users’ demands and intentions from the textual descriptions of rules and user-rule interactions, producing representations of these inferred demands and intentions. Finally, a dual representation alignment method is introduced, integrating user demands and intentions derived from LLMs with the global structural information from the augmentation graph within a contrastive learning framework to enhance representation performance. Extensive experiments demonstrate the effectiveness of LLM4TAP, achieving the maximum improvements of 8.96% and 4.72% over the strongest compared methods on the IFTTT and Wyze datasets, respectively.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 10","pages":"13157-13169"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LLM4TAP: LLM-Enhanced TAP Rule Recommendation\",\"authors\":\"Gang Wu;Liang Hu;Yuxiao Hu;Xingbo Xiong;Feng Wang\",\"doi\":\"10.1109/JIOT.2025.3532977\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Trigger-action programming (TAP) is an Internet of Things (IoT) paradigm that enables nonprofessional end-users to automate smart devices by formulating rules, such as “IF you leave home, THEN turn off lights.” As the number of possible rules increases, manually browsing these rules becomes increasingly time-consuming for users. Recently, graph-based recommendation systems have shown promise in automatically suggesting rules, yet they face two issues. First, these studies struggle to identify and differentiate users’ demands (e.g., turning off lights) and intentions (e.g., energy saving). Second, they overlook the issue of sparse user-rule interactions. In this article, we propose LLM4TAP, a large language model (LLM) enhanced TAP rule recommendation framework, to address these issues. Prior to LLM4TAP, a user-rule graph is constructed to represent the interactions between users and rules. Within LLM4TAP, singular value decomposition is first employed to generate an augmented graph, strengthening global structural relationships between users and rules. Next, the reasoning capabilities of LLMs are utilized to infer users’ demands and intentions from the textual descriptions of rules and user-rule interactions, producing representations of these inferred demands and intentions. Finally, a dual representation alignment method is introduced, integrating user demands and intentions derived from LLMs with the global structural information from the augmentation graph within a contrastive learning framework to enhance representation performance. Extensive experiments demonstrate the effectiveness of LLM4TAP, achieving the maximum improvements of 8.96% and 4.72% over the strongest compared methods on the IFTTT and Wyze datasets, respectively.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 10\",\"pages\":\"13157-13169\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-02-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10876163/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10876163/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

触发-动作编程（TAP）是一种物联网（IoT）范例，它使非专业的最终用户能够通过制定规则来自动化智能设备，例如“如果你离开家，然后关灯”。随着可能的规则数量的增加，手动浏览这些规则对用户来说变得越来越耗时。近年来，基于图的推荐系统在自动推荐规则方面显示出了很大的前景，但它们面临着两个问题。首先，这些研究难以识别和区分用户的需求（例如，关灯）和意图（例如，节能）。其次，他们忽略了稀疏的用户规则交互问题。在本文中，我们提出LLM4TAP，一个大型语言模型（LLM）增强的TAP规则推荐框架来解决这些问题。在LLM4TAP之前，构造了一个用户规则图来表示用户和规则之间的交互。在LLM4TAP中，首先使用奇异值分解来生成增宽图，从而加强用户和规则之间的全局结构关系。接下来，利用llm的推理能力，从规则和用户-规则交互的文本描述中推断出用户的需求和意图，生成这些推断出的需求和意图的表示。最后，介绍了一种双表示对齐方法，在对比学习框架内将llm的用户需求和意图与增强图的全局结构信息相结合，以提高表示性能。大量的实验证明了LLM4TAP的有效性，在IFTTT和Wyze数据集上，与最强的比较方法相比，LLM4TAP的最大改进率分别为8.96%和4.72%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

LLM4TAP: LLM-Enhanced TAP Rule Recommendation

Trigger-action programming (TAP) is an Internet of Things (IoT) paradigm that enables nonprofessional end-users to automate smart devices by formulating rules, such as “IF you leave home, THEN turn off lights.” As the number of possible rules increases, manually browsing these rules becomes increasingly time-consuming for users. Recently, graph-based recommendation systems have shown promise in automatically suggesting rules, yet they face two issues. First, these studies struggle to identify and differentiate users’ demands (e.g., turning off lights) and intentions (e.g., energy saving). Second, they overlook the issue of sparse user-rule interactions. In this article, we propose LLM4TAP, a large language model (LLM) enhanced TAP rule recommendation framework, to address these issues. Prior to LLM4TAP, a user-rule graph is constructed to represent the interactions between users and rules. Within LLM4TAP, singular value decomposition is first employed to generate an augmented graph, strengthening global structural relationships between users and rules. Next, the reasoning capabilities of LLMs are utilized to infer users’ demands and intentions from the textual descriptions of rules and user-rule interactions, producing representations of these inferred demands and intentions. Finally, a dual representation alignment method is introduced, integrating user demands and intentions derived from LLMs with the global structural information from the augmentation graph within a contrastive learning framework to enhance representation performance. Extensive experiments demonstrate the effectiveness of LLM4TAP, achieving the maximum improvements of 8.96% and 4.72% over the strongest compared methods on the IFTTT and Wyze datasets, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.