{"title":"web上的正则表达式","authors":"Renáta Hodován, Zoltán Herczeg, Ákos Kiss","doi":"10.1109/WSE.2010.5623572","DOIUrl":null,"url":null,"abstract":"On the web, static pages fade into the past; web sites use server and client-side scripting techniques to improve the user experience. For client-side scripting, JavaScript is by far the most commonly used dynamic language. In these scripts, regular expressions are widely used for several purposes, e.g., for text filtering and form validation. In this paper, we take a closer look at regular expressions on the web. We investigate historical data and determine the trends of the use of regular expressions over time on various web sites. Then, we also take a snapshot of the current status. We find that although regular expressions are being used more and more often, their use is highly repetitive: on the most popular web sites only 4% of the regular expressions are unique. Based on this result, we show that web browsers can cut down regular expression compilation time to the third by applying caching techniques.","PeriodicalId":231763,"journal":{"name":"2010 12th IEEE International Symposium on Web Systems Evolution (WSE)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Regular expressions on the web\",\"authors\":\"Renáta Hodován, Zoltán Herczeg, Ákos Kiss\",\"doi\":\"10.1109/WSE.2010.5623572\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"On the web, static pages fade into the past; web sites use server and client-side scripting techniques to improve the user experience. For client-side scripting, JavaScript is by far the most commonly used dynamic language. In these scripts, regular expressions are widely used for several purposes, e.g., for text filtering and form validation. In this paper, we take a closer look at regular expressions on the web. We investigate historical data and determine the trends of the use of regular expressions over time on various web sites. Then, we also take a snapshot of the current status. We find that although regular expressions are being used more and more often, their use is highly repetitive: on the most popular web sites only 4% of the regular expressions are unique. Based on this result, we show that web browsers can cut down regular expression compilation time to the third by applying caching techniques.\",\"PeriodicalId\":231763,\"journal\":{\"name\":\"2010 12th IEEE International Symposium on Web Systems Evolution (WSE)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 12th IEEE International Symposium on Web Systems Evolution (WSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSE.2010.5623572\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 12th IEEE International Symposium on Web Systems Evolution (WSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSE.2010.5623572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the web, static pages fade into the past; web sites use server and client-side scripting techniques to improve the user experience. For client-side scripting, JavaScript is by far the most commonly used dynamic language. In these scripts, regular expressions are widely used for several purposes, e.g., for text filtering and form validation. In this paper, we take a closer look at regular expressions on the web. We investigate historical data and determine the trends of the use of regular expressions over time on various web sites. Then, we also take a snapshot of the current status. We find that although regular expressions are being used more and more often, their use is highly repetitive: on the most popular web sites only 4% of the regular expressions are unique. Based on this result, we show that web browsers can cut down regular expression compilation time to the third by applying caching techniques.