{"title":"Modified Path Measure to Assess Sentence Similarity","authors":"M. K. Prasad, Poonam Sharma","doi":"10.1109/INFOCOMTECH.2018.8722419","DOIUrl":null,"url":null,"abstract":"Sentence similarity can be calculated by various measures, but the measures that can use semantic information between the words perform better compared to others. Out of these, the measures which are related to a corpus, or which are knowledge related are more significant in the sentence similarity domains. The applications which use knowledge based measures tend to give more accurate results and are very much in coincidence with human similarity. These measures use path length between the concepts or information content between the words to derive the similarity between the words. Some of the semantic similarity measures generate synonym sets of the words to evaluate similarity, but these measures focus mainly on generating noun or verb synonym sets which can be enhanced by generating all the synonym sets. In this paper, a metric PathM is proposed to calculate word-pair similarity by generating all the synonym sets of the words and it is enhanced to calculate the sentence similarity. The measure is compared with path measure and the results obtained are better in comparison with other measures.","PeriodicalId":175757,"journal":{"name":"2018 Conference on Information and Communication Technology (CICT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Conference on Information and Communication Technology (CICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOMTECH.2018.8722419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sentence similarity can be calculated by various measures, but the measures that can use semantic information between the words perform better compared to others. Out of these, the measures which are related to a corpus, or which are knowledge related are more significant in the sentence similarity domains. The applications which use knowledge based measures tend to give more accurate results and are very much in coincidence with human similarity. These measures use path length between the concepts or information content between the words to derive the similarity between the words. Some of the semantic similarity measures generate synonym sets of the words to evaluate similarity, but these measures focus mainly on generating noun or verb synonym sets which can be enhanced by generating all the synonym sets. In this paper, a metric PathM is proposed to calculate word-pair similarity by generating all the synonym sets of the words and it is enhanced to calculate the sentence similarity. The measure is compared with path measure and the results obtained are better in comparison with other measures.