Hannes Obweger, Martin Suntinger, Josef Schiefer, G. Raidl
{"title":"复杂事件序列的相似性搜索","authors":"Hannes Obweger, Martin Suntinger, Josef Schiefer, G. Raidl","doi":"10.1109/RCIS.2010.5507423","DOIUrl":null,"url":null,"abstract":"In this paper we present a generic similarity model for time-stamped sequences of complex business events. It builds upon the idea of deriving similarity from deviations between the pattern sequence and its best-possible representation in the candidate sequence. Which representation is considered optimal solely depends on the analyst's current focus and interest; the model thus foresees highest configurability to adequately balance aspects such as single-event similarities, order, timing, and missing events. The model is furthermore applicable for both sub-sequence searching and full-sequence matching. As an extension to the base model, we discuss enhanced patternmodeling facilities, e.g., to ensure a maximal time interval between two or more candidate events. The proposed tree-search algorithm allows for a seamless integration of such extensions.","PeriodicalId":333366,"journal":{"name":"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Similarity searching in sequences of complex events\",\"authors\":\"Hannes Obweger, Martin Suntinger, Josef Schiefer, G. Raidl\",\"doi\":\"10.1109/RCIS.2010.5507423\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present a generic similarity model for time-stamped sequences of complex business events. It builds upon the idea of deriving similarity from deviations between the pattern sequence and its best-possible representation in the candidate sequence. Which representation is considered optimal solely depends on the analyst's current focus and interest; the model thus foresees highest configurability to adequately balance aspects such as single-event similarities, order, timing, and missing events. The model is furthermore applicable for both sub-sequence searching and full-sequence matching. As an extension to the base model, we discuss enhanced patternmodeling facilities, e.g., to ensure a maximal time interval between two or more candidate events. The proposed tree-search algorithm allows for a seamless integration of such extensions.\",\"PeriodicalId\":333366,\"journal\":{\"name\":\"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RCIS.2010.5507423\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Fourth International Conference on Research Challenges in Information Science (RCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2010.5507423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Similarity searching in sequences of complex events
In this paper we present a generic similarity model for time-stamped sequences of complex business events. It builds upon the idea of deriving similarity from deviations between the pattern sequence and its best-possible representation in the candidate sequence. Which representation is considered optimal solely depends on the analyst's current focus and interest; the model thus foresees highest configurability to adequately balance aspects such as single-event similarities, order, timing, and missing events. The model is furthermore applicable for both sub-sequence searching and full-sequence matching. As an extension to the base model, we discuss enhanced patternmodeling facilities, e.g., to ensure a maximal time interval between two or more candidate events. The proposed tree-search algorithm allows for a seamless integration of such extensions.