{"title":"Word Relationships are not Created Equally","authors":"Rajeswaran Viswanathan, S. S.","doi":"10.1109/punecon52575.2021.9686528","DOIUrl":null,"url":null,"abstract":"Construction of knowledge graph from plain text entails identifying relationships among the words. Downstream tasks like finding similarity is sensitive to these relationships. From PubMed abstracts, words are extracted. “Nearest neighbor” words are identified as candidate words using Word2Vec, Glove and FastText. Conceptnet is a popular knowledge graph using which we find relationship between these words. Similarity for each word pair is calculated. Random Effects Model (REM) is applied to study this relationship strata using the similarity scores. Analysis shows that there is heterogeneity among the relationships independent of the base similarity metrics used.","PeriodicalId":154406,"journal":{"name":"2021 IEEE Pune Section International Conference (PuneCon)","volume":"268 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Pune Section International Conference (PuneCon)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/punecon52575.2021.9686528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Construction of knowledge graph from plain text entails identifying relationships among the words. Downstream tasks like finding similarity is sensitive to these relationships. From PubMed abstracts, words are extracted. “Nearest neighbor” words are identified as candidate words using Word2Vec, Glove and FastText. Conceptnet is a popular knowledge graph using which we find relationship between these words. Similarity for each word pair is calculated. Random Effects Model (REM) is applied to study this relationship strata using the similarity scores. Analysis shows that there is heterogeneity among the relationships independent of the base similarity metrics used.