Similarity, probability and database organisation

A. Ramer, Hansuk Yu
{"title":"Similarity, probability and database organisation","authors":"A. Ramer, Hansuk Yu","doi":"10.1109/AFSS.1996.583603","DOIUrl":null,"url":null,"abstract":"The question of storing imperfect data within a database framework has been discussed in the literature almost since the time when the first formal database structure was proposed. The matter received an additional impetus with the advent of fuzzy sets and its promise of formally capturing the notion of imprecision. If the imperfection is one of uncertainty as to whether a certain data item (or data structure) is actually present in the database, then the use of probability would be natural. However, when the imperfection relates to the relative proximity of the actual data to some idealised value, the use of a fuzzy-like model (possibility or similarity) seems warranted. In practice, the choice of model appears to be a matter of personal preference for a researcher. Characteristically, there has been no attempt to utilise both possibility and probability within the same data model. This might be due, in part, to the difficulty of capturing, in a reasonable way, various interactions between the simultaneously present possibility and probability weights. This paper describes work in progress, dealing with our research in modelling the interaction between similarity among the tuples (a binary function) and the probabilities of the tuples in the relational data model. The importance of such a study is underlined by the ubiquity of situations which involve, in a natural fashion, both probabilistic and possibilistic considerations. We illustrate it on two examples.","PeriodicalId":197019,"journal":{"name":"Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1996-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing in Intelligent Systems and Information Processing. Proceedings of the 1996 Asian Fuzzy Systems Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AFSS.1996.583603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The question of storing imperfect data within a database framework has been discussed in the literature almost since the time when the first formal database structure was proposed. The matter received an additional impetus with the advent of fuzzy sets and its promise of formally capturing the notion of imprecision. If the imperfection is one of uncertainty as to whether a certain data item (or data structure) is actually present in the database, then the use of probability would be natural. However, when the imperfection relates to the relative proximity of the actual data to some idealised value, the use of a fuzzy-like model (possibility or similarity) seems warranted. In practice, the choice of model appears to be a matter of personal preference for a researcher. Characteristically, there has been no attempt to utilise both possibility and probability within the same data model. This might be due, in part, to the difficulty of capturing, in a reasonable way, various interactions between the simultaneously present possibility and probability weights. This paper describes work in progress, dealing with our research in modelling the interaction between similarity among the tuples (a binary function) and the probabilities of the tuples in the relational data model. The importance of such a study is underlined by the ubiquity of situations which involve, in a natural fashion, both probabilistic and possibilistic considerations. We illustrate it on two examples.
相似性,概率和数据库组织
在数据库框架中存储不完美数据的问题几乎从第一个正式数据库结构被提出的时候就已经在文献中讨论过了。随着模糊集的出现,以及模糊集有望正式捕获不精确概念,这个问题得到了额外的推动。如果不完美是不确定某个数据项(或数据结构)是否实际存在于数据库中,那么使用概率是很自然的。然而,当不完美涉及到实际数据与某些理想值的相对接近时,使用类似模糊的模型(可能性或相似性)似乎是合理的。在实践中,模型的选择似乎是研究人员个人偏好的问题。典型的是,没有尝试在同一数据模型中同时利用可能性和概率。这可能部分是由于难以以合理的方式捕捉同时存在的可能性和概率权重之间的各种相互作用。本文描述了正在进行的工作,涉及我们在关系数据模型中对元组之间的相似性(二元函数)和元组概率之间的相互作用进行建模的研究。这种研究的重要性被普遍存在的情况所强调,这些情况以一种自然的方式涉及到概率和可能性的考虑。我们用两个例子来说明它。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信