{"title":"Web Log Analysis Based on Hadoop Technology","authors":"Zhao Yongjian","doi":"10.1109/ICSGEA.2019.00136","DOIUrl":null,"url":null,"abstract":"The world is now in the information age. It is important to explore the valuable information from massive and diverse data. The analysis of the website log has great significance in practical application. Hadoop framework can provide reliable, scalable, distributed processing of large data sets. Hadoop series framework is utilized to conduct offline data processing and analysis for the website access log. The number of different browser, which visits to the master site access log, is counted. Different browser kernel, operating system, terminal type and other content access times are calculated and analyzed.","PeriodicalId":201721,"journal":{"name":"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSGEA.2019.00136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The world is now in the information age. It is important to explore the valuable information from massive and diverse data. The analysis of the website log has great significance in practical application. Hadoop framework can provide reliable, scalable, distributed processing of large data sets. Hadoop series framework is utilized to conduct offline data processing and analysis for the website access log. The number of different browser, which visits to the master site access log, is counted. Different browser kernel, operating system, terminal type and other content access times are calculated and analyzed.