{"title":"基于堆叠压缩自编码器的文本分类研究","authors":"Yang Yu, Jing Hui","doi":"10.1109/EIIS.2017.8298701","DOIUrl":null,"url":null,"abstract":"In order to improve the classification effect of text classification and reduce the error rate of classification, this paper proposes the thought of text classification based on Stacked Contractive Auto-Encoder (SCAE) by stacking network layer by layer, the SCAE constitutes the depth of the neural network by unsupervised training and learning text to improve the robustness of feature extraction, the network uses the gradient descent algorithm to optimize parameters of the network. This paper adopts an improved TFIDF method calculating the weight of feature words. Though experiments, the CAE and SAE (Sparse Auto-Encoder) are compared, the support vector machine (SVM) is used to classify text. Experiment result shows that the classification performance of single layer CAE is better than that of single layer SAE, and the classification performance of SCAE is better than that of SSAE (Stacked Sparse Auto-Encoder).","PeriodicalId":434246,"journal":{"name":"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A study on text classification based on stacked contractive auto-encoder\",\"authors\":\"Yang Yu, Jing Hui\",\"doi\":\"10.1109/EIIS.2017.8298701\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In order to improve the classification effect of text classification and reduce the error rate of classification, this paper proposes the thought of text classification based on Stacked Contractive Auto-Encoder (SCAE) by stacking network layer by layer, the SCAE constitutes the depth of the neural network by unsupervised training and learning text to improve the robustness of feature extraction, the network uses the gradient descent algorithm to optimize parameters of the network. This paper adopts an improved TFIDF method calculating the weight of feature words. Though experiments, the CAE and SAE (Sparse Auto-Encoder) are compared, the support vector machine (SVM) is used to classify text. Experiment result shows that the classification performance of single layer CAE is better than that of single layer SAE, and the classification performance of SCAE is better than that of SSAE (Stacked Sparse Auto-Encoder).\",\"PeriodicalId\":434246,\"journal\":{\"name\":\"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIIS.2017.8298701\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIIS.2017.8298701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A study on text classification based on stacked contractive auto-encoder
In order to improve the classification effect of text classification and reduce the error rate of classification, this paper proposes the thought of text classification based on Stacked Contractive Auto-Encoder (SCAE) by stacking network layer by layer, the SCAE constitutes the depth of the neural network by unsupervised training and learning text to improve the robustness of feature extraction, the network uses the gradient descent algorithm to optimize parameters of the network. This paper adopts an improved TFIDF method calculating the weight of feature words. Though experiments, the CAE and SAE (Sparse Auto-Encoder) are compared, the support vector machine (SVM) is used to classify text. Experiment result shows that the classification performance of single layer CAE is better than that of single layer SAE, and the classification performance of SCAE is better than that of SSAE (Stacked Sparse Auto-Encoder).