{"title":"Positional trace encoding for next activity prediction in event logs","authors":"Antonio Pellicani , Michelangelo Ceci","doi":"10.1016/j.knosys.2025.113544","DOIUrl":null,"url":null,"abstract":"<div><div>The analysis of log data, generated by running processes in many application domains, enables organizations to identify opportunities for operational improvements. For instance, in healthcare, analyzing patient treatment logs can optimize care pathways; in manufacturing, production line logs can reveal bottlenecks; and in customer service, ticket resolution logs can streamline response protocols. One key analytical task is predicting the next activity in a process, which supports operational decision-making through better resource allocation and proactive response to customer needs. In this paper, we solve the next activity prediction task by exploiting a novel positional encoding approach that is based on sliding windows. This approach allows us to consider both a way to adapt to changes in the data distribution, and exploit positional information of the activities in the traces. The method proposed in this paper, called OREO, takes into account these aspects through a positional encoding tightly coupled with specific types of deep neural network architectures. The results on eight real-world process logs show the superiority of the models exploiting OREO encoding over state-of-the-art approaches, confirming our initial intuition of benefits gained by combining a time-window based model with positional information.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"319 ","pages":"Article 113544"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125005908","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The analysis of log data, generated by running processes in many application domains, enables organizations to identify opportunities for operational improvements. For instance, in healthcare, analyzing patient treatment logs can optimize care pathways; in manufacturing, production line logs can reveal bottlenecks; and in customer service, ticket resolution logs can streamline response protocols. One key analytical task is predicting the next activity in a process, which supports operational decision-making through better resource allocation and proactive response to customer needs. In this paper, we solve the next activity prediction task by exploiting a novel positional encoding approach that is based on sliding windows. This approach allows us to consider both a way to adapt to changes in the data distribution, and exploit positional information of the activities in the traces. The method proposed in this paper, called OREO, takes into account these aspects through a positional encoding tightly coupled with specific types of deep neural network architectures. The results on eight real-world process logs show the superiority of the models exploiting OREO encoding over state-of-the-art approaches, confirming our initial intuition of benefits gained by combining a time-window based model with positional information.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.