Yijie Zhong , Feifan Wu , Mengying Guo , Xiaolian Zhang , Meng Wang , Haofen Wang
{"title":"Meta-PKE: Memory-Enhanced Task-Adaptive Personal Knowledge Extraction in Daily Life","authors":"Yijie Zhong , Feifan Wu , Mengying Guo , Xiaolian Zhang , Meng Wang , Haofen Wang","doi":"10.1016/j.ipm.2025.104097","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we propose the task of personal knowledge extraction to get structured knowledge from personal data in daily life. The existing information extraction methods struggle to handle this task due to the personal data’s multi-source, fine-grained, dynamic, and personalized nature. They fail to select necessary extraction tasks adaptively, cope with diverse scenarios in daily life, and overlook the assistance of historical personal data for the extraction task. Thus, we propose a novel <strong>M</strong>emory-<strong>E</strong>nhanced <strong>T</strong>ask-<strong>A</strong>daptive <strong>P</strong>ersonal <strong>K</strong>nowledge <strong>E</strong>xtraction method called <em>Meta-PKE</em>. We introduce a task selection module to select the necessary extraction tasks without manual specification according to input personal data. When executing the selected extraction tasks, we record the historical data as the memory and design a memory-enhanced progressive extraction module. Structured personal knowledge is extracted in a coarse-to-fine manner aided by the optimal historical data from a carefully designed memory selection strategy. In addition, we propose a knowledge re-identification module to ensure the completeness of the extracted personal knowledge while avoiding the hallucinations engendered by the large language models. Extensive experiments reflect that, only utilizing the model with a small number of parameters (7B <em>v.s.</em> <span><math><mo>></mo></math></span>100B), Meta-PKE outperforms the state-of-the-art methods by near 15%, 20%, and 10% on 3 datasets, which cover not only daily but also non-daily scenarios more efficiently.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 4","pages":"Article 104097"},"PeriodicalIF":7.4000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000391","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose the task of personal knowledge extraction to get structured knowledge from personal data in daily life. The existing information extraction methods struggle to handle this task due to the personal data’s multi-source, fine-grained, dynamic, and personalized nature. They fail to select necessary extraction tasks adaptively, cope with diverse scenarios in daily life, and overlook the assistance of historical personal data for the extraction task. Thus, we propose a novel Memory-Enhanced Task-Adaptive Personal Knowledge Extraction method called Meta-PKE. We introduce a task selection module to select the necessary extraction tasks without manual specification according to input personal data. When executing the selected extraction tasks, we record the historical data as the memory and design a memory-enhanced progressive extraction module. Structured personal knowledge is extracted in a coarse-to-fine manner aided by the optimal historical data from a carefully designed memory selection strategy. In addition, we propose a knowledge re-identification module to ensure the completeness of the extracted personal knowledge while avoiding the hallucinations engendered by the large language models. Extensive experiments reflect that, only utilizing the model with a small number of parameters (7B v.s. 100B), Meta-PKE outperforms the state-of-the-art methods by near 15%, 20%, and 10% on 3 datasets, which cover not only daily but also non-daily scenarios more efficiently.
本文提出了个人知识抽取任务,从日常生活中的个人数据中提取结构化的知识。由于个人数据具有多源、细粒度、动态和个性化的特性,现有的信息提取方法难以处理这一任务。不能自适应地选择必要的提取任务,不能适应日常生活中的各种场景,忽略了个人历史数据对提取任务的辅助。因此,我们提出了一种新的记忆增强任务自适应个人知识提取方法Meta-PKE。我们引入了任务选择模块,可以根据输入的个人数据选择需要的提取任务,无需人工规范。在执行选定的提取任务时,我们将历史数据记录为内存,并设计了一个内存增强的渐进式提取模块。结构化的个人知识是在精心设计的记忆选择策略的最佳历史数据的帮助下,以一种从粗到精的方式提取出来的。此外,我们提出了一个知识再识别模块,以确保提取的个人知识的完整性,同时避免了大型语言模型产生的幻觉。大量实验表明,仅使用具有少量参数的模型(7B vs .s >100B), Meta-PKE在3个数据集上的性能就比最先进的方法高出近15%,20%和10%,不仅可以有效地覆盖日常场景,还可以更有效地覆盖非日常场景。
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.