Process Discovery Enhancement with Trace Clustering and Profiling

Q2 Computer Science

Annals of Emerging Technologies in Computing Pub Date : 2021-10-01 DOI:10.33166/aetic.2021.04.001

M. Faizan, M. Zuhairi, S. Ismail

{"title":"Process Discovery Enhancement with Trace Clustering and Profiling","authors":"M. Faizan, M. Zuhairi, S. Ismail","doi":"10.33166/aetic.2021.04.001","DOIUrl":null,"url":null,"abstract":"The potential in process mining is progressively growing due to the increasing amount of event-data. Process mining strategies use event-logs to automatically classify process models, recommend improvements, predict processing times, check conformance, and recognize anomalies/deviations and bottlenecks. However, proper handling of event-logs while evaluating and using them as input is crucial to any process mining technique. When process mining techniques are applied to flexible systems with a large number of decisions to take at runtime, the outcome is often unstructured or semi-structured process models that are hard to comprehend. Existing approaches are good at discovering and visualizing structured processes but often struggle with less structured ones. Surprisingly, process mining is most useful in domains where flexibility is desired. A good illustration is the \"patient treatment\" process in a hospital, where the ability to deviate from dealing with changing conditions is crucial. It is useful to have insights into actual operations. However, there is a significant amount of diversity, which contributes to complicated, difficult-to-understand models. Trace clustering is a method for decreasing the complexity of process models in this context while also increasing their comprehensibility and accuracy. This paper discusses process mining, event-logs, and presenting a clustering approach to pre-process event-logs, i.e., a homogeneous subset of the event-log is created. A process model is generated for each subset. These homogeneous subsets are then evaluated independently from each other, which significantly improving the quality of mining results in flexible environments. The presented approach improves the fitness and precision of a discovered model while reducing its complexity, resulting in well-structured and easily understandable process discovery results.","PeriodicalId":36440,"journal":{"name":"Annals of Emerging Technologies in Computing","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Emerging Technologies in Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33166/aetic.2021.04.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 2

Abstract

The potential in process mining is progressively growing due to the increasing amount of event-data. Process mining strategies use event-logs to automatically classify process models, recommend improvements, predict processing times, check conformance, and recognize anomalies/deviations and bottlenecks. However, proper handling of event-logs while evaluating and using them as input is crucial to any process mining technique. When process mining techniques are applied to flexible systems with a large number of decisions to take at runtime, the outcome is often unstructured or semi-structured process models that are hard to comprehend. Existing approaches are good at discovering and visualizing structured processes but often struggle with less structured ones. Surprisingly, process mining is most useful in domains where flexibility is desired. A good illustration is the "patient treatment" process in a hospital, where the ability to deviate from dealing with changing conditions is crucial. It is useful to have insights into actual operations. However, there is a significant amount of diversity, which contributes to complicated, difficult-to-understand models. Trace clustering is a method for decreasing the complexity of process models in this context while also increasing their comprehensibility and accuracy. This paper discusses process mining, event-logs, and presenting a clustering approach to pre-process event-logs, i.e., a homogeneous subset of the event-log is created. A process model is generated for each subset. These homogeneous subsets are then evaluated independently from each other, which significantly improving the quality of mining results in flexible environments. The presented approach improves the fitness and precision of a discovered model while reducing its complexity, resulting in well-structured and easily understandable process discovery results.

查看原文本刊更多论文

使用跟踪聚类和分析增强进程发现

由于事件数据量的增加，过程挖掘的潜力正在逐步增长。流程挖掘策略使用事件日志自动对流程模型进行分类、提出改进建议、预测处理时间、检查一致性以及识别异常/偏差和瓶颈。然而，在评估和使用事件日志作为输入时，正确处理事件日志对于任何流程挖掘技术都是至关重要的。当将流程挖掘技术应用于运行时需要做出大量决策的灵活系统时，结果通常是难以理解的非结构化或半结构化流程模型。现有的方法善于发现和可视化结构化的过程，但常常与不那么结构化的过程作斗争。令人惊讶的是，过程挖掘在需要灵活性的领域中最有用。一个很好的例子是医院的“病人治疗”过程，在这个过程中，应对变化的条件的能力是至关重要的。了解实际操作是有用的。然而，存在大量的多样性，这导致了复杂的、难以理解的模型。在这种情况下，跟踪聚类是一种降低过程模型复杂性的方法，同时也提高了它们的可理解性和准确性。本文讨论了过程挖掘、事件日志，并提出了一种预处理事件日志的聚类方法，即创建事件日志的同构子集。为每个子集生成一个流程模型。然后，这些同质子集相互独立地进行评估，从而显着提高了灵活环境下挖掘结果的质量。该方法提高了发现模型的适应度和精度，同时降低了模型的复杂性，得到结构良好且易于理解的过程发现结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annals of Emerging Technologies in Computing Computer Science-Computer Science (all)

CiteScore

3.50

自引率

0.00%

发文量