{"title":"An Efficient Method for Biomedical Word Sense Disambiguation Based on Web-Kernel Similarity","authors":"Mohammed Rais, M. Bekkali, Abdelmonaime Lachkar","doi":"10.4018/IJHISI.20211001.OA9","DOIUrl":null,"url":null,"abstract":"Searching for the best sense for a polysemous word remains one of the greatest challenges in the representation of biomedical text. To this end, word sense disambiguation (WSD) algorithms mostly rely on an external source of knowledge, like a thesaurus or ontology, for automatically selecting the proper concept of an ambiguous term in a given window of context using semantic similarity and relatedness measures. In this paper, the authors propose a web-based kernel function for measuring the semantic relatedness between concepts to disambiguate an expression versus multiple possible concepts. This measure uses the large volume of documents returned by PubMed search engine to determine the greater context for a biomedical short text through a new term weighting scheme based on rough set theory (RST). To illustrate the efficiency of our proposed method, they evaluate a WSD algorithm based on this measure on a biomedical dataset (MSH-WSD) that contains 203 ambiguous terms and acronyms. The obtained results demonstrate promising improvements. KEyWoRDS Biomedical Word Sense Disambiguation, Conceptualization, Context Concept, MSH-WSD, Rough Set Theory, Short Text Similarity","PeriodicalId":101861,"journal":{"name":"Int. J. Heal. Inf. Syst. Informatics","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Heal. Inf. Syst. Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJHISI.20211001.OA9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Searching for the best sense for a polysemous word remains one of the greatest challenges in the representation of biomedical text. To this end, word sense disambiguation (WSD) algorithms mostly rely on an external source of knowledge, like a thesaurus or ontology, for automatically selecting the proper concept of an ambiguous term in a given window of context using semantic similarity and relatedness measures. In this paper, the authors propose a web-based kernel function for measuring the semantic relatedness between concepts to disambiguate an expression versus multiple possible concepts. This measure uses the large volume of documents returned by PubMed search engine to determine the greater context for a biomedical short text through a new term weighting scheme based on rough set theory (RST). To illustrate the efficiency of our proposed method, they evaluate a WSD algorithm based on this measure on a biomedical dataset (MSH-WSD) that contains 203 ambiguous terms and acronyms. The obtained results demonstrate promising improvements. KEyWoRDS Biomedical Word Sense Disambiguation, Conceptualization, Context Concept, MSH-WSD, Rough Set Theory, Short Text Similarity