{"title":"基于改进CNN和SVM的Web文本分类算法研究","authors":"Zhiquan Wang, Zhiyi Qu","doi":"10.1109/ICCT.2017.8359971","DOIUrl":null,"url":null,"abstract":"Web text classification is one of the research focuses and core technologies in Web information retrieval and data mining, and it has been widely concerned and developed rapidly in recent years. The convolutional neural network (CNN), as a kind of deep learning model, can extract the features of the text data accurately and reduce the complexity of models at the same time. The support vector machine (SVM) has always had the advantages of being effective and stable in traditional machine learning algorithms. According to the characteristics of CNN and SVM, this paper proposes a new method of Web text classification based on the improved CNN and SVM, using the CNN model with the five-layer network structure to extract text feature and then classify and predict by using SVM. Finally, it will obtain an excellent effect on mixed text data set.","PeriodicalId":199874,"journal":{"name":"2017 IEEE 17th International Conference on Communication Technology (ICCT)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"43","resultStr":"{\"title\":\"Research on Web text classification algorithm based on improved CNN and SVM\",\"authors\":\"Zhiquan Wang, Zhiyi Qu\",\"doi\":\"10.1109/ICCT.2017.8359971\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Web text classification is one of the research focuses and core technologies in Web information retrieval and data mining, and it has been widely concerned and developed rapidly in recent years. The convolutional neural network (CNN), as a kind of deep learning model, can extract the features of the text data accurately and reduce the complexity of models at the same time. The support vector machine (SVM) has always had the advantages of being effective and stable in traditional machine learning algorithms. According to the characteristics of CNN and SVM, this paper proposes a new method of Web text classification based on the improved CNN and SVM, using the CNN model with the five-layer network structure to extract text feature and then classify and predict by using SVM. Finally, it will obtain an excellent effect on mixed text data set.\",\"PeriodicalId\":199874,\"journal\":{\"name\":\"2017 IEEE 17th International Conference on Communication Technology (ICCT)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"43\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 17th International Conference on Communication Technology (ICCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCT.2017.8359971\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 17th International Conference on Communication Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT.2017.8359971","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Web text classification algorithm based on improved CNN and SVM
Web text classification is one of the research focuses and core technologies in Web information retrieval and data mining, and it has been widely concerned and developed rapidly in recent years. The convolutional neural network (CNN), as a kind of deep learning model, can extract the features of the text data accurately and reduce the complexity of models at the same time. The support vector machine (SVM) has always had the advantages of being effective and stable in traditional machine learning algorithms. According to the characteristics of CNN and SVM, this paper proposes a new method of Web text classification based on the improved CNN and SVM, using the CNN model with the five-layer network structure to extract text feature and then classify and predict by using SVM. Finally, it will obtain an excellent effect on mixed text data set.