{"title":"Parallel Matching of Regular Expressions with BSP Automata","authors":"Thibaut Tachon","doi":"10.1109/HPCS48598.2019.9188181","DOIUrl":null,"url":null,"abstract":"Regular expression matching is a core component of many applications including patterns search in text, deep inspection of packet or lexical analysis. Sequential regular expression matching lacks efficiency for large amount of data whereas parallel regular expression matching overhead requires a large number of processors to become negligible. This paper presents a transformation from regular expression (RE) into a parallel form named BSP regular expression (BSPRE). This transformation added to the transformation from BSPRE to parallel automata (BSPA) enable the parallel matching of regular expression. We compare this approach to enumeration method and observe substantial improvement for small number of processors. The automatic transformation from RE to BSPA through BSPRE is the first example of an infinite family of BSP programs that can be generated automatically and that are not simple specializations of a finite library.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS48598.2019.9188181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Regular expression matching is a core component of many applications including patterns search in text, deep inspection of packet or lexical analysis. Sequential regular expression matching lacks efficiency for large amount of data whereas parallel regular expression matching overhead requires a large number of processors to become negligible. This paper presents a transformation from regular expression (RE) into a parallel form named BSP regular expression (BSPRE). This transformation added to the transformation from BSPRE to parallel automata (BSPA) enable the parallel matching of regular expression. We compare this approach to enumeration method and observe substantial improvement for small number of processors. The automatic transformation from RE to BSPA through BSPRE is the first example of an infinite family of BSP programs that can be generated automatically and that are not simple specializations of a finite library.