{"title":"A new approach for unsupervised word sense disambiguation in Hindi language using graph connectivity measures","authors":"Amita Jain, D. K. Lobiyal","doi":"10.1504/IJAISC.2014.065800","DOIUrl":null,"url":null,"abstract":"Word sense disambiguation (WSD) is an important task in computational linguistics as it is essential for many language understanding applications. In this paper, we propose a graph-based unsupervised WSD method for Hindi text which disambiguates multiple ambiguous words present in the sentence simultaneously. In our approach, we first construct the semantic graph for each interpretation of the given sentence by establishing semantic relations between the pair of words present in the sentence. We use Hindi WordNet to establish semantic relations between the pair of words and then we construct the graph. We find the cost of spanning tree corresponding to each semantic graph and the interpretation for which spanning tree has the minimum cost is identified. This interpretation is considered as the resulting interpretation. Our approach also considers all open class words unlike the previous approaches which focus only on noun.","PeriodicalId":364571,"journal":{"name":"Int. J. Artif. Intell. Soft Comput.","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Artif. Intell. Soft Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJAISC.2014.065800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Word sense disambiguation (WSD) is an important task in computational linguistics as it is essential for many language understanding applications. In this paper, we propose a graph-based unsupervised WSD method for Hindi text which disambiguates multiple ambiguous words present in the sentence simultaneously. In our approach, we first construct the semantic graph for each interpretation of the given sentence by establishing semantic relations between the pair of words present in the sentence. We use Hindi WordNet to establish semantic relations between the pair of words and then we construct the graph. We find the cost of spanning tree corresponding to each semantic graph and the interpretation for which spanning tree has the minimum cost is identified. This interpretation is considered as the resulting interpretation. Our approach also considers all open class words unlike the previous approaches which focus only on noun.