细化流程图:以对象为中心的流程挖掘中的非结构化数据

IF 3.4 2区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Andreas Egger , Tobias Fehrer , Wolfgang Kratsch , Niklas Wördehoff , Fabian König , Maximilian Röglinger
{"title":"细化流程图:以对象为中心的流程挖掘中的非结构化数据","authors":"Andreas Egger ,&nbsp;Tobias Fehrer ,&nbsp;Wolfgang Kratsch ,&nbsp;Niklas Wördehoff ,&nbsp;Fabian König ,&nbsp;Maximilian Röglinger","doi":"10.1016/j.is.2025.102582","DOIUrl":null,"url":null,"abstract":"<div><div>Process mining aims to discover, monitor, and improve processes. To this end, process mining techniques use event data, typically extracted from information systems and organized along process instances. The inherent complexity of real-world processes has driven the recent introduction of object-centric process mining, allowing for a more comprehensive view of processes. Another avenue of research contributing to more complete process analyses is integrating unstructured data, which can enhance traditional event logs by extracting hitherto unidentified process information. Although combining the object-centric perspective with event log enrichment from unstructured data sources holds promising potential, such investigation remains in its infancy. Against this background, this study presents the OCRAUD, a reference architecture that provides guidance on using unstructured data sources and traditional event logs for object-centric process mining. A design science research process was employed to design and evaluate the OCRAUD. This involved conducting a total of 20 expert interviews over two rounds, comparing the OCRAUD to competing artifacts, instantiating the artifact for the use of video and sensor data, developing a software prototype, and applying the prototype to real-world data. This work contributes to process mining by guiding the combination of unstructured data with traditional event logs, incorporating an object-centric representation of event data. The instantiation targets video and sensor data, thereby demonstrating the use of the artifact. This enables researchers and practitioners to instantiate the artifact for other data types or specific use cases. The published code of the software prototype allows for further development of the implemented algorithms.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"134 ","pages":"Article 102582"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Refining the process picture: Unstructured data in object-centric process mining\",\"authors\":\"Andreas Egger ,&nbsp;Tobias Fehrer ,&nbsp;Wolfgang Kratsch ,&nbsp;Niklas Wördehoff ,&nbsp;Fabian König ,&nbsp;Maximilian Röglinger\",\"doi\":\"10.1016/j.is.2025.102582\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Process mining aims to discover, monitor, and improve processes. To this end, process mining techniques use event data, typically extracted from information systems and organized along process instances. The inherent complexity of real-world processes has driven the recent introduction of object-centric process mining, allowing for a more comprehensive view of processes. Another avenue of research contributing to more complete process analyses is integrating unstructured data, which can enhance traditional event logs by extracting hitherto unidentified process information. Although combining the object-centric perspective with event log enrichment from unstructured data sources holds promising potential, such investigation remains in its infancy. Against this background, this study presents the OCRAUD, a reference architecture that provides guidance on using unstructured data sources and traditional event logs for object-centric process mining. A design science research process was employed to design and evaluate the OCRAUD. This involved conducting a total of 20 expert interviews over two rounds, comparing the OCRAUD to competing artifacts, instantiating the artifact for the use of video and sensor data, developing a software prototype, and applying the prototype to real-world data. This work contributes to process mining by guiding the combination of unstructured data with traditional event logs, incorporating an object-centric representation of event data. The instantiation targets video and sensor data, thereby demonstrating the use of the artifact. This enables researchers and practitioners to instantiate the artifact for other data types or specific use cases. The published code of the software prototype allows for further development of the implemented algorithms.</div></div>\",\"PeriodicalId\":50363,\"journal\":{\"name\":\"Information Systems\",\"volume\":\"134 \",\"pages\":\"Article 102582\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306437925000663\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306437925000663","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

过程挖掘的目的是发现、监视和改进过程。为此,流程挖掘技术使用事件数据,通常从信息系统中提取并按照流程实例进行组织。现实世界过程固有的复杂性推动了最近引入的以对象为中心的过程挖掘,允许更全面的过程视图。另一个有助于更完整过程分析的研究途径是集成非结构化数据,它可以通过提取迄今未识别的过程信息来增强传统的事件日志。尽管将以对象为中心的透视图与来自非结构化数据源的事件日志丰富相结合具有很大的潜力,但此类研究仍处于起步阶段。在此背景下,本研究提出了OCRAUD,这是一种参考体系结构,为使用非结构化数据源和传统事件日志进行以对象为中心的过程挖掘提供指导。采用设计科学的研究流程对ococaud进行设计和评价。这包括在两轮中进行总共20次专家访谈,将OCRAUD与竞争的工件进行比较,为使用视频和传感器数据实例化工件,开发软件原型,并将原型应用于现实世界的数据。这项工作通过指导非结构化数据与传统事件日志的组合,结合事件数据的以对象为中心的表示,有助于流程挖掘。实例化以视频和传感器数据为目标,从而演示了工件的使用。这使得研究人员和实践者能够为其他数据类型或特定用例实例化工件。发布的软件原型代码允许进一步开发实现的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Refining the process picture: Unstructured data in object-centric process mining
Process mining aims to discover, monitor, and improve processes. To this end, process mining techniques use event data, typically extracted from information systems and organized along process instances. The inherent complexity of real-world processes has driven the recent introduction of object-centric process mining, allowing for a more comprehensive view of processes. Another avenue of research contributing to more complete process analyses is integrating unstructured data, which can enhance traditional event logs by extracting hitherto unidentified process information. Although combining the object-centric perspective with event log enrichment from unstructured data sources holds promising potential, such investigation remains in its infancy. Against this background, this study presents the OCRAUD, a reference architecture that provides guidance on using unstructured data sources and traditional event logs for object-centric process mining. A design science research process was employed to design and evaluate the OCRAUD. This involved conducting a total of 20 expert interviews over two rounds, comparing the OCRAUD to competing artifacts, instantiating the artifact for the use of video and sensor data, developing a software prototype, and applying the prototype to real-world data. This work contributes to process mining by guiding the combination of unstructured data with traditional event logs, incorporating an object-centric representation of event data. The instantiation targets video and sensor data, thereby demonstrating the use of the artifact. This enables researchers and practitioners to instantiate the artifact for other data types or specific use cases. The published code of the software prototype allows for further development of the implemented algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Systems
Information Systems 工程技术-计算机:信息系统
CiteScore
9.40
自引率
2.70%
发文量
112
审稿时长
53 days
期刊介绍: Information systems are the software and hardware systems that support data-intensive applications. The journal Information Systems publishes articles concerning the design and implementation of languages, data models, process models, algorithms, software and hardware for information systems. Subject areas include data management issues as presented in the principal international database conferences (e.g., ACM SIGMOD/PODS, VLDB, ICDE and ICDT/EDBT) as well as data-related issues from the fields of data mining/machine learning, information retrieval coordinated with structured data, internet and cloud data management, business process management, web semantics, visual and audio information systems, scientific computing, and data science. Implementation papers having to do with massively parallel data management, fault tolerance in practice, and special purpose hardware for data-intensive systems are also welcome. Manuscripts from application domains, such as urban informatics, social and natural science, and Internet of Things, are also welcome. All papers should highlight innovative solutions to data management problems such as new data models, performance enhancements, and show how those innovations contribute to the goals of the application.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信