Z. Noori, Keeley A. Crockett, Z. Bandar, Mohammed Al-Mousa
{"title":"An Arabic Word Similarity Measure for Semantic Conversational Agents","authors":"Z. Noori, Keeley A. Crockett, Z. Bandar, Mohammed Al-Mousa","doi":"10.1109/ASAR.2018.8480252","DOIUrl":null,"url":null,"abstract":"Word similarity measures are used to measure the semantic relatedness between two words. Whereas traditional English measures exist, relatively little research has been undertaken in developing such measures for Modern Standard Arabic largely due to the linguistic challenges of the language. Domain coverage is also an issue when looking to select the best measure for incorporation into a semantic conversational agent. The information source used within the measure should be general yet capable of dealing with domain specific language to ensure robust and appropriate responses. This paper proposes a word similarity measure that utilises the length, and depth of the words from within a domain specific lexical tree that is used as the information source. The measure is compared with an existing Arabic word similarity measure through evaluation on a generic published dataset and the results show the new measure gives high correlation with human ratings.","PeriodicalId":165564,"journal":{"name":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASAR.2018.8480252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Word similarity measures are used to measure the semantic relatedness between two words. Whereas traditional English measures exist, relatively little research has been undertaken in developing such measures for Modern Standard Arabic largely due to the linguistic challenges of the language. Domain coverage is also an issue when looking to select the best measure for incorporation into a semantic conversational agent. The information source used within the measure should be general yet capable of dealing with domain specific language to ensure robust and appropriate responses. This paper proposes a word similarity measure that utilises the length, and depth of the words from within a domain specific lexical tree that is used as the information source. The measure is compared with an existing Arabic word similarity measure through evaluation on a generic published dataset and the results show the new measure gives high correlation with human ratings.