{"title":"一种基于改进ECCD和类别权重的文档分类系统","authors":"Chungseok Han, Sang-Yong Park, Soowon Lee","doi":"10.3745/KIPSTB.2012.19B.4.237","DOIUrl":null,"url":null,"abstract":"Web information service needs a document classification system for efficient management and conveniently searches. Existing document classification systems have a problem of low accuracy in classification, if a few number of feature words is selected in documents or if the number of documents that belong to a specific category is excessively large. To solve this problem, we propose a document classification system using `Modified ECCD` feature selection method and `Category Weight for each Document`. Experimental results show that the `Modified ECCD` feature selection method has higher accuracy in classification than and the ECCD method. Moreover, combining the `Category Weight for each Document` feature value and `Modified ECCD` feature selection method results better accuracy in classification.","PeriodicalId":122700,"journal":{"name":"The Kips Transactions:partb","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Document Classification System Using Modified ECCD and Category Weight for each Document\",\"authors\":\"Chungseok Han, Sang-Yong Park, Soowon Lee\",\"doi\":\"10.3745/KIPSTB.2012.19B.4.237\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Web information service needs a document classification system for efficient management and conveniently searches. Existing document classification systems have a problem of low accuracy in classification, if a few number of feature words is selected in documents or if the number of documents that belong to a specific category is excessively large. To solve this problem, we propose a document classification system using `Modified ECCD` feature selection method and `Category Weight for each Document`. Experimental results show that the `Modified ECCD` feature selection method has higher accuracy in classification than and the ECCD method. Moreover, combining the `Category Weight for each Document` feature value and `Modified ECCD` feature selection method results better accuracy in classification.\",\"PeriodicalId\":122700,\"journal\":{\"name\":\"The Kips Transactions:partb\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Kips Transactions:partb\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3745/KIPSTB.2012.19B.4.237\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Kips Transactions:partb","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3745/KIPSTB.2012.19B.4.237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Document Classification System Using Modified ECCD and Category Weight for each Document
Web information service needs a document classification system for efficient management and conveniently searches. Existing document classification systems have a problem of low accuracy in classification, if a few number of feature words is selected in documents or if the number of documents that belong to a specific category is excessively large. To solve this problem, we propose a document classification system using `Modified ECCD` feature selection method and `Category Weight for each Document`. Experimental results show that the `Modified ECCD` feature selection method has higher accuracy in classification than and the ECCD method. Moreover, combining the `Category Weight for each Document` feature value and `Modified ECCD` feature selection method results better accuracy in classification.