Eunji Lee, Jeongin Kim, Junho Choi, Chang Choi, Byeongkyu Ko, Pankoo Kim
{"title":"一种基于马尔可夫逻辑网络的文档分类语义加权方法","authors":"Eunji Lee, Jeongin Kim, Junho Choi, Chang Choi, Byeongkyu Ko, Pankoo Kim","doi":"10.1145/2663761.2664212","DOIUrl":null,"url":null,"abstract":"This paper proposes a semantic weighting method to classify textural documents. Human lives in the world where web documents have a great potential and the amount of valuable information has been consistently growing over the year. There is a problem that finding relevant web documents corresponding to what users want is more difficult due to the huge amount of web size. For this reason, there have been many researchers overcome this problem. The most important thing is document classification. All documents are composed of numerous words. Many classification methods have been extracted keywords from documents and then analyzed keywords pattern or frequency. In this paper, we propose Category Term Weight (CTW) using keywords from documents in order to enhance performance in document classification. CTW combines keywords frequency with semantic information. The frequency and semantic information have a great potential to find similarities between documents. That is why we calculates CTW from collection of training documents. After this step, CTW from unknown document and CTW in previous Category Term Database will be applied by designed Markov Logic Networks Model. Our designed MLNs Model and existing Naive-bayse Model will be compared by applied CTW. The experimental results shows the improvement of precision compare with the existing model.","PeriodicalId":120340,"journal":{"name":"Research in Adaptive and Convergent Systems","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A semantic weighting method for document classification based on Markov logic networks\",\"authors\":\"Eunji Lee, Jeongin Kim, Junho Choi, Chang Choi, Byeongkyu Ko, Pankoo Kim\",\"doi\":\"10.1145/2663761.2664212\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a semantic weighting method to classify textural documents. Human lives in the world where web documents have a great potential and the amount of valuable information has been consistently growing over the year. There is a problem that finding relevant web documents corresponding to what users want is more difficult due to the huge amount of web size. For this reason, there have been many researchers overcome this problem. The most important thing is document classification. All documents are composed of numerous words. Many classification methods have been extracted keywords from documents and then analyzed keywords pattern or frequency. In this paper, we propose Category Term Weight (CTW) using keywords from documents in order to enhance performance in document classification. CTW combines keywords frequency with semantic information. The frequency and semantic information have a great potential to find similarities between documents. That is why we calculates CTW from collection of training documents. After this step, CTW from unknown document and CTW in previous Category Term Database will be applied by designed Markov Logic Networks Model. Our designed MLNs Model and existing Naive-bayse Model will be compared by applied CTW. The experimental results shows the improvement of precision compare with the existing model.\",\"PeriodicalId\":120340,\"journal\":{\"name\":\"Research in Adaptive and Convergent Systems\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research in Adaptive and Convergent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2663761.2664212\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Adaptive and Convergent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2663761.2664212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A semantic weighting method for document classification based on Markov logic networks
This paper proposes a semantic weighting method to classify textural documents. Human lives in the world where web documents have a great potential and the amount of valuable information has been consistently growing over the year. There is a problem that finding relevant web documents corresponding to what users want is more difficult due to the huge amount of web size. For this reason, there have been many researchers overcome this problem. The most important thing is document classification. All documents are composed of numerous words. Many classification methods have been extracted keywords from documents and then analyzed keywords pattern or frequency. In this paper, we propose Category Term Weight (CTW) using keywords from documents in order to enhance performance in document classification. CTW combines keywords frequency with semantic information. The frequency and semantic information have a great potential to find similarities between documents. That is why we calculates CTW from collection of training documents. After this step, CTW from unknown document and CTW in previous Category Term Database will be applied by designed Markov Logic Networks Model. Our designed MLNs Model and existing Naive-bayse Model will be compared by applied CTW. The experimental results shows the improvement of precision compare with the existing model.