{"title":"Computing Semantic Similarities Based on Machine-Readable Dictionaries","authors":"Hui Liu, Jinglei Zhao, R. Lu","doi":"10.1109/WSCS.2008.9","DOIUrl":null,"url":null,"abstract":"The measurement of semantic similarity is a foundation work in semantic computing. In this paper the authors study the similarity measure between two words. Different from previous works, this paper suggests a novel method that relies on machine-readable dictionaries for measuring similarities. Machine-readable dictionaries are more widely available than other kinds of lexical resources. If two words have similar definitions, they are semantically similar. A definition is represented by a definition vector. Each dimension represents a word in the dictionary. The score of each dimension in the vector is calculated by a variation of tf*idf. Evaluations show that this method achieves competitive results in both Chinese and English.","PeriodicalId":378383,"journal":{"name":"IEEE International Workshop on Semantic Computing and Systems","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Workshop on Semantic Computing and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSCS.2008.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
The measurement of semantic similarity is a foundation work in semantic computing. In this paper the authors study the similarity measure between two words. Different from previous works, this paper suggests a novel method that relies on machine-readable dictionaries for measuring similarities. Machine-readable dictionaries are more widely available than other kinds of lexical resources. If two words have similar definitions, they are semantically similar. A definition is represented by a definition vector. Each dimension represents a word in the dictionary. The score of each dimension in the vector is calculated by a variation of tf*idf. Evaluations show that this method achieves competitive results in both Chinese and English.