Hannes Obweger, Martin Suntinger, Josef Schiefer, G. Raidl
{"title":"Similarity searching in sequences of complex events","authors":"Hannes Obweger, Martin Suntinger, Josef Schiefer, G. Raidl","doi":"10.1109/RCIS.2010.5507423","DOIUrl":null,"url":null,"abstract":"In this paper we present a generic similarity model for time-stamped sequences of complex business events. It builds upon the idea of deriving similarity from deviations between the pattern sequence and its best-possible representation in the candidate sequence. Which representation is considered optimal solely depends on the analyst's current focus and interest; the model thus foresees highest configurability to adequately balance aspects such as single-event similarities, order, timing, and missing events. The model is furthermore applicable for both sub-sequence searching and full-sequence matching. As an extension to the base model, we discuss enhanced patternmodeling facilities, e.g., to ensure a maximal time interval between two or more candidate events. The proposed tree-search algorithm allows for a seamless integration of such extensions.","PeriodicalId":333366,"journal":{"name":"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2010.5507423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15
Abstract
In this paper we present a generic similarity model for time-stamped sequences of complex business events. It builds upon the idea of deriving similarity from deviations between the pattern sequence and its best-possible representation in the candidate sequence. Which representation is considered optimal solely depends on the analyst's current focus and interest; the model thus foresees highest configurability to adequately balance aspects such as single-event similarities, order, timing, and missing events. The model is furthermore applicable for both sub-sequence searching and full-sequence matching. As an extension to the base model, we discuss enhanced patternmodeling facilities, e.g., to ensure a maximal time interval between two or more candidate events. The proposed tree-search algorithm allows for a seamless integration of such extensions.