{"title":"Web Scrapping Tools and Techniques: A Brief Survey","authors":"Ruchitaa Raj N R, Nandhakumar Raj S, V. M","doi":"10.1109/ICITIIT57246.2023.10068666","DOIUrl":null,"url":null,"abstract":"Web scraping can be done using many languages such as C++, Java, JavaScript, PhP, Python, Ruby, etc. Among them, Python stands to be the most powerful language with lots of inbuilt libraries that supports web scraping, extensive support for third-party open-source libraries, and higher speeds compared to other languages. Python libraries for web scraping are designed for fast and highly accurate data extraction. There are many libraries available for web scraping and the developer can choose the respective library in accordance with their scraping application. This paper focuses on the study of several web scraping tools and techniques and analyze the performance of those tools and present the statistical significance of the results.","PeriodicalId":170485,"journal":{"name":"2023 4th International Conference on Innovative Trends in Information Technology (ICITIIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference on Innovative Trends in Information Technology (ICITIIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITIIT57246.2023.10068666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Web scraping can be done using many languages such as C++, Java, JavaScript, PhP, Python, Ruby, etc. Among them, Python stands to be the most powerful language with lots of inbuilt libraries that supports web scraping, extensive support for third-party open-source libraries, and higher speeds compared to other languages. Python libraries for web scraping are designed for fast and highly accurate data extraction. There are many libraries available for web scraping and the developer can choose the respective library in accordance with their scraping application. This paper focuses on the study of several web scraping tools and techniques and analyze the performance of those tools and present the statistical significance of the results.