{"title":"基于朴素贝叶斯分类器的恶意网站自动分类","authors":"Shuai Wang, Yashi Wang, Minhan Tang","doi":"10.1109/ICISCAE51034.2020.9236912","DOIUrl":null,"url":null,"abstract":"In recent years, the spread of malicious websites has had an increasingly serious impact on people's lives. To solve this problem, in this paper, we mainly focus on achieving the factor analysis of the website category and accurate identification of unknown information, so as to classify the benign and others, which help users avoid the risk of malicious websites. In the procedure, Naïve Bayes and other effective methods are used to calculate probability to test and train the model of website classification. Our result shows the response of benign and other tests well compares to others, suggesting that the Naïve Bayes method is more suitable to solve the differentiation of good websites, which accuracy can be reached to 90 percent. Besides, the training model of that classification is more accurate in that datasheet. Furthermore, if the data is more abundant and the technical bottleneck can be solved, the realization of highest accuracy of website classification using Naïve Bayes is possible.","PeriodicalId":355473,"journal":{"name":"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Auto Malicious Websites Classification Based on Naive Bayes Classifier\",\"authors\":\"Shuai Wang, Yashi Wang, Minhan Tang\",\"doi\":\"10.1109/ICISCAE51034.2020.9236912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the spread of malicious websites has had an increasingly serious impact on people's lives. To solve this problem, in this paper, we mainly focus on achieving the factor analysis of the website category and accurate identification of unknown information, so as to classify the benign and others, which help users avoid the risk of malicious websites. In the procedure, Naïve Bayes and other effective methods are used to calculate probability to test and train the model of website classification. Our result shows the response of benign and other tests well compares to others, suggesting that the Naïve Bayes method is more suitable to solve the differentiation of good websites, which accuracy can be reached to 90 percent. Besides, the training model of that classification is more accurate in that datasheet. Furthermore, if the data is more abundant and the technical bottleneck can be solved, the realization of highest accuracy of website classification using Naïve Bayes is possible.\",\"PeriodicalId\":355473,\"journal\":{\"name\":\"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISCAE51034.2020.9236912\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCAE51034.2020.9236912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Auto Malicious Websites Classification Based on Naive Bayes Classifier
In recent years, the spread of malicious websites has had an increasingly serious impact on people's lives. To solve this problem, in this paper, we mainly focus on achieving the factor analysis of the website category and accurate identification of unknown information, so as to classify the benign and others, which help users avoid the risk of malicious websites. In the procedure, Naïve Bayes and other effective methods are used to calculate probability to test and train the model of website classification. Our result shows the response of benign and other tests well compares to others, suggesting that the Naïve Bayes method is more suitable to solve the differentiation of good websites, which accuracy can be reached to 90 percent. Besides, the training model of that classification is more accurate in that datasheet. Furthermore, if the data is more abundant and the technical bottleneck can be solved, the realization of highest accuracy of website classification using Naïve Bayes is possible.