Xiaofei Chang, Lei Liu, Mengtao Sun, Yalu Jia, Chunxia Zhang
{"title":"A feature optimization algorithm of concept similarity based on Chinese wikipedia","authors":"Xiaofei Chang, Lei Liu, Mengtao Sun, Yalu Jia, Chunxia Zhang","doi":"10.1109/FSKD.2017.8393108","DOIUrl":null,"url":null,"abstract":"Concept similarity measure based on feature vector has wide application in various fields, but the problems of polysemy and synonym existing in feature vector affect the similarity measure. We present a feature optimization algorithm based on Chinese Wikipedia which can reduces this effect. First we build a POS feature dictionary (POS-Dic) and a POS Tongyici Cilin(POS-Cilin), and then a new feature vector is used for concept similarity measure. Experiments show that the algorithm effectively reduces the influence of polysemy and synonym on the concept similarity measure.","PeriodicalId":236093,"journal":{"name":"2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2017.8393108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Concept similarity measure based on feature vector has wide application in various fields, but the problems of polysemy and synonym existing in feature vector affect the similarity measure. We present a feature optimization algorithm based on Chinese Wikipedia which can reduces this effect. First we build a POS feature dictionary (POS-Dic) and a POS Tongyici Cilin(POS-Cilin), and then a new feature vector is used for concept similarity measure. Experiments show that the algorithm effectively reduces the influence of polysemy and synonym on the concept similarity measure.