{"title":"Modified algor ithm for searching regularities in large dimensional data based on genetic optimization","authors":"V. Bova, D. Leshchanov","doi":"10.34219/2078-8320-2021-12-3-67-72","DOIUrl":null,"url":null,"abstract":"A method of searching for patterns in sequences of events is proposed for detecting hidden patterns in largedimensional data when performing information retrieval tasks, based on the theory of sequential patterns. Searching for sequential patterns is a complex computational task whose goal is to retrieve all frequent sequences representing potential relationships within elements from a transactional database of sequences of search activity events with a given minimum support. To increase the computational efficiency of the method, a modified algorithm for generating sequential patterns has been developed, at the first stage of which AprioriAll is performed, which forms frequent candidate sequences of all possible lengths, and at the second stage, a genetic algorithm for optimizing the input parameters of the feature space of the generated set to search for maximum patterns.","PeriodicalId":299496,"journal":{"name":"Informatization and communication","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatization and communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.34219/2078-8320-2021-12-3-67-72","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A method of searching for patterns in sequences of events is proposed for detecting hidden patterns in largedimensional data when performing information retrieval tasks, based on the theory of sequential patterns. Searching for sequential patterns is a complex computational task whose goal is to retrieve all frequent sequences representing potential relationships within elements from a transactional database of sequences of search activity events with a given minimum support. To increase the computational efficiency of the method, a modified algorithm for generating sequential patterns has been developed, at the first stage of which AprioriAll is performed, which forms frequent candidate sequences of all possible lengths, and at the second stage, a genetic algorithm for optimizing the input parameters of the feature space of the generated set to search for maximum patterns.