{"title":"Agent-Based Parallelization of a Multi-Dimensional Semantic Database Model","authors":"Alex Li, M. Fukuda","doi":"10.1109/IRI58017.2023.00019","DOIUrl":null,"url":null,"abstract":"Responses to database queries that may be even identical should vary if they are given under a different user context. For instance, queries for wild animals in the context of the ocean versus mountains should be different. Announced in 1993 [1], Mathematical Model of Meaning (MMM) provides users with capabilities to extract data items tightly coupled under different semantic spaces. Such a space is created dynamically with user-defined impression words to compute semantic equivalence and similarity between data items. MMM computes semantic correlations between the key and other data items to achieve dynamic data querying. However, a semantic space creation and a data correlative calculation are computationally demanding. We consider MMM as a practical database application of multi-agent technologies, construct a space over a cluster system, and have multi-agents explore for a given target and its surrounding data items. We use the Multi-Agent Spatial Simulation (MASS) library to implement an agent-based semantic database system and to measure its parallel execution. Compared to a sequential MMM implementation, MASS-based parallelization yielded a 22-time speedup when creating a space, mainly achieved with matrix multiplication. MASS also reduced the time required for distance sorting of multi-dimensional vectors by 23.7%.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI58017.2023.00019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Responses to database queries that may be even identical should vary if they are given under a different user context. For instance, queries for wild animals in the context of the ocean versus mountains should be different. Announced in 1993 [1], Mathematical Model of Meaning (MMM) provides users with capabilities to extract data items tightly coupled under different semantic spaces. Such a space is created dynamically with user-defined impression words to compute semantic equivalence and similarity between data items. MMM computes semantic correlations between the key and other data items to achieve dynamic data querying. However, a semantic space creation and a data correlative calculation are computationally demanding. We consider MMM as a practical database application of multi-agent technologies, construct a space over a cluster system, and have multi-agents explore for a given target and its surrounding data items. We use the Multi-Agent Spatial Simulation (MASS) library to implement an agent-based semantic database system and to measure its parallel execution. Compared to a sequential MMM implementation, MASS-based parallelization yielded a 22-time speedup when creating a space, mainly achieved with matrix multiplication. MASS also reduced the time required for distance sorting of multi-dimensional vectors by 23.7%.
对数据库查询的响应即使是相同的,如果在不同的用户上下文中给出,也应该有所不同。例如,在海洋和山脉上下文中查询野生动物应该是不同的。数学意义模型(Mathematical Model of Meaning, MMM)于1993年宣布[1],它为用户提供了提取不同语义空间下紧密耦合的数据项的能力。该空间由用户定义的印象词动态创建,以计算数据项之间的语义等价性和相似度。MMM通过计算键和其他数据项之间的语义相关性来实现动态数据查询。然而,语义空间的创建和数据的相关计算需要大量的计算量。我们将MMM视为多智能体技术在数据库中的实际应用,在集群系统上构建一个空间,并让多智能体探索给定目标及其周围的数据项。利用多智能体空间仿真(MASS)库实现了一个基于智能体的语义数据库系统,并对其并行执行进行了测量。与顺序的MMM实现相比,基于质量的并行化在创建空间时产生了22倍的加速,主要是通过矩阵乘法实现的。MASS还将多维向量的距离排序所需的时间减少了23.7%。