重温概率关系分析:使用概率关系图对短文中的词语进行关系相似性分析

Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz
{"title":"重温概率关系分析:使用概率关系图对短文中的词语进行关系相似性分析","authors":"Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz","doi":"10.47000/tjmcs.1240729","DOIUrl":null,"url":null,"abstract":"Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.","PeriodicalId":506513,"journal":{"name":"Turkish Journal of Mathematics and Computer Science","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts\",\"authors\":\"Dima Alnahas, Abdullah Ateş, A. Aydin, Baris Baykant Alagöz\",\"doi\":\"10.47000/tjmcs.1240729\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.\",\"PeriodicalId\":506513,\"journal\":{\"name\":\"Turkish Journal of Mathematics and Computer Science\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Turkish Journal of Mathematics and Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47000/tjmcs.1240729\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Mathematics and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47000/tjmcs.1240729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

关系图为高度复杂的多组件系统的结构和关系分析提供了有用的工具。概率关系图模型可以通过概率链接来表示系统组件之间的关系。这些图类型已被广泛用于马尔可夫模型和 bigram 概率的图形表示。本研究介绍了文本条目的概率图模型中关系相似性的含义。文章讨论了短文概率分析中两种基本相似性度量的几个应用实例。为此,文章说明了利用文本词条的 bigram 概率矩阵构建概率图模型的方法,并形成了输入词向量和输出词向量的向量空间。在该向量空间中,利用余弦相似度和均方误差度量来评估短文中词素对之间的概率关系相似性。通过使用短文的概率关系图,对词素对进行了关系互换性分析,并定义了置信度指数参数来表示这些分析的可靠性。根据示例文本的分析结果,讨论了这些图在语言处理和语言学中的潜在应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts
Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in probabilistic analysis of short texts. To this end, construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, utilization of cosine similarity and mean squared error measures are demonstrated to evaluate probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信