{"title":"Algorithm Improvement of Vocabulary Semantic Similarity with HowNet","authors":"Y. Qu, Tie-Zhu Yang, Min Wang","doi":"10.1145/3036331.3036333","DOIUrl":null,"url":null,"abstract":"The vocabulary semantic similarity calculation is one of the key technologies of natural language processing, and it has been widely used in many fields such as information extraction and text classification. So how to effectively acquire the information relationships is the core issue in the study of data mining and knowledge organization.This research puts forward the concepts of semantic correlation degree and the same semantic ratio according to the semantic system structure of HowNet, and designs a computation method of vocabulary semantic similarity based on semantic distance and the degree of semantic association. The method not only considers the effect of sememes' relation to semantic similarity, but also takes into account the lexical semantic correlation degree to semantic similarity. Experimental results prove that the improved algorithm effectively improved the veracity and accuracy of semantic similarity computation method, which makes the consequence satisfy people's subjective cognition and more reasonable.","PeriodicalId":22356,"journal":{"name":"Tenth International Conference on Computer Modeling and Simulation (uksim 2008)","volume":"1 1","pages":"114-118"},"PeriodicalIF":0.0000,"publicationDate":"2017-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tenth International Conference on Computer Modeling and Simulation (uksim 2008)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3036331.3036333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The vocabulary semantic similarity calculation is one of the key technologies of natural language processing, and it has been widely used in many fields such as information extraction and text classification. So how to effectively acquire the information relationships is the core issue in the study of data mining and knowledge organization.This research puts forward the concepts of semantic correlation degree and the same semantic ratio according to the semantic system structure of HowNet, and designs a computation method of vocabulary semantic similarity based on semantic distance and the degree of semantic association. The method not only considers the effect of sememes' relation to semantic similarity, but also takes into account the lexical semantic correlation degree to semantic similarity. Experimental results prove that the improved algorithm effectively improved the veracity and accuracy of semantic similarity computation method, which makes the consequence satisfy people's subjective cognition and more reasonable.