2D Vector Representation of Binomial Hierarchical Tree Items

Ilknur Dönmez, Seda Karateke, M. Zontul
{"title":"2D Vector Representation of Binomial Hierarchical Tree Items","authors":"Ilknur Dönmez, Seda Karateke, M. Zontul","doi":"10.1109/ICTACSE50438.2022.10009738","DOIUrl":null,"url":null,"abstract":"Today Artificial Intelligence (AI) algorithms need to represent different kinds of input items in numeric or vector format. Some input data can easily be transformed to numeric or vector format but the structure of some special data prevents direct and easy transformation. For instance, we can represent air condition using humidity, pressure, and temperature values with a vector that has three features and we can understand the similarity of two different air measurements using cosine-similarity of two vectors. But if we are dealing with a general ontology tree, which has elements \"entity\" as the root element, its two children \"living things\" and \"non-living things\" as first- level elements repeatedly children of \"living things\" that are \"Animals\", \"Plants\" as second level elements, it is harder to represent this kind of data with numeric values. The ontology tree starts from the general items and goes to specific items. If we want to represent an element of this tree with a vector; how can it be possible? And if we want the measured similarity using some methods like cosine-similarity, which one similarity is higher, (\"Animal\" and \"non-living thing\") or (\"Animal\" and \"Living thing\")? How should we select the values of this vector for each item of the hierarchical tree? In this paper, we propose an original and basic idea to represent the hierarchical tree items with 2D vectors and in the proposed method the cosine-similarity metric works for measuring the semantic similarity of represented items at the same level as parent items. There are two important results related to our representation: (1) The \"y\" values of the items give the hierarchical level of the item. (2) For the same level items, the cosine similarities between the parent item and child items are higher if the child belongs to this parent compared to other childrens'. In other words, the cosine similarity between the parent item and child items is highest if the child belongs to this parent.","PeriodicalId":301767,"journal":{"name":"2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE)","volume":"35 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTACSE50438.2022.10009738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Today Artificial Intelligence (AI) algorithms need to represent different kinds of input items in numeric or vector format. Some input data can easily be transformed to numeric or vector format but the structure of some special data prevents direct and easy transformation. For instance, we can represent air condition using humidity, pressure, and temperature values with a vector that has three features and we can understand the similarity of two different air measurements using cosine-similarity of two vectors. But if we are dealing with a general ontology tree, which has elements "entity" as the root element, its two children "living things" and "non-living things" as first- level elements repeatedly children of "living things" that are "Animals", "Plants" as second level elements, it is harder to represent this kind of data with numeric values. The ontology tree starts from the general items and goes to specific items. If we want to represent an element of this tree with a vector; how can it be possible? And if we want the measured similarity using some methods like cosine-similarity, which one similarity is higher, ("Animal" and "non-living thing") or ("Animal" and "Living thing")? How should we select the values of this vector for each item of the hierarchical tree? In this paper, we propose an original and basic idea to represent the hierarchical tree items with 2D vectors and in the proposed method the cosine-similarity metric works for measuring the semantic similarity of represented items at the same level as parent items. There are two important results related to our representation: (1) The "y" values of the items give the hierarchical level of the item. (2) For the same level items, the cosine similarities between the parent item and child items are higher if the child belongs to this parent compared to other childrens'. In other words, the cosine similarity between the parent item and child items is highest if the child belongs to this parent.
二项层次树项的二维矢量表示
如今,人工智能(AI)算法需要以数字或矢量格式表示不同类型的输入项。一些输入数据可以很容易地转换为数字或矢量格式,但一些特殊数据的结构阻碍了直接和容易的转换。例如,我们可以用一个有三个特征的向量来表示湿度、压力和温度值,我们可以用两个向量的余弦相似度来理解两个不同空气测量值的相似性。但是,如果我们处理的是一个通用的本体树,它以“实体”元素为根元素,它的两个子元素“生物”和“非生物”作为第一级元素,而“生物”的子元素是“动物”,“植物”作为第二级元素,则很难用数值表示这类数据。本体树从一般项目开始,到特定项目。如果我们想用向量来表示这棵树的一个元素;这怎么可能呢?如果我们想用余弦相似度之类的方法测量相似度,那么哪一种相似度更高,是“动物”和“非生物”还是“动物”和“生物”?我们应该如何为层次树的每一项选择这个向量的值?在本文中,我们提出了一种用二维向量表示层次树项目的新颖的基本思想,在该方法中,余弦相似度度量用于度量被表示项目与父项目在同一层次上的语义相似度。有两个与我们的表示相关的重要结果:(1)项目的“y”值给出了项目的层次级别。(2)在同一等级的项目中,父项目与子项目之间的余弦相似度,如果孩子属于父项目,则高于其他孩子。换句话说,如果子元素属于父元素,则父元素和子元素之间的余弦相似度最高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信