Why the relational data model matters for climate data management

IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Ezequiel Cimadevilla
{"title":"Why the relational data model matters for climate data management","authors":"Ezequiel Cimadevilla","doi":"10.1016/j.cageo.2025.105931","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient data management of climate data banks, in particular those generated by Global or Regional Climate Models, is an important requirement for precise understanding of current changes in the climate system. Current data management practices in the climate community are based on the analysis of binary files for storage of multidimensional arrays that require ad hoc software libraries for accessing the data. Several approaches are being developed to ease and facilitate climate data management and data analysis. However, the theoretical foundations that cause climate data manipulation difficulties remain unchallenged. The Relational Data Model was proposed as a formal solution for database management based on mathematical logic. It has been widely accepted in the industry and has survived the test of time. However, the foundational principles of the Relational Data Model have been overlooked by the climate data management community, mostly due to a lack of emphasis in the relevance of mathematical logic for database management and misunderstanding between physical and logical levels of abstraction. As a result, climate data management workflows lack the rigor and formality provided by the Relational Data Model. This work explains the Relational Data Model at the logical level of abstraction and provides the arguments, clarifies the misconceptions, and justifies its adoption for climate data management in the context of gridded data generated by climate models.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"201 ","pages":"Article 105931"},"PeriodicalIF":4.2000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300425000810","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Efficient data management of climate data banks, in particular those generated by Global or Regional Climate Models, is an important requirement for precise understanding of current changes in the climate system. Current data management practices in the climate community are based on the analysis of binary files for storage of multidimensional arrays that require ad hoc software libraries for accessing the data. Several approaches are being developed to ease and facilitate climate data management and data analysis. However, the theoretical foundations that cause climate data manipulation difficulties remain unchallenged. The Relational Data Model was proposed as a formal solution for database management based on mathematical logic. It has been widely accepted in the industry and has survived the test of time. However, the foundational principles of the Relational Data Model have been overlooked by the climate data management community, mostly due to a lack of emphasis in the relevance of mathematical logic for database management and misunderstanding between physical and logical levels of abstraction. As a result, climate data management workflows lack the rigor and formality provided by the Relational Data Model. This work explains the Relational Data Model at the logical level of abstraction and provides the arguments, clarifies the misconceptions, and justifies its adoption for climate data management in the context of gridded data generated by climate models.
为什么关系数据模型对气候数据管理很重要
对气候数据库,特别是由全球或区域气候模式产生的气候数据库进行有效的数据管理,是精确了解当前气候系统变化的重要要求。当前气候界的数据管理实践是基于对多维数组存储的二进制文件的分析,这需要专门的软件库来访问数据。目前正在制定若干方法,以简化和促进气候数据管理和数据分析。然而,造成气候数据操纵困难的理论基础仍然没有受到挑战。关系数据模型是一种基于数学逻辑的数据库管理的形式化解决方案。它已被业界广泛接受,并经受住了时间的考验。然而,关系数据模型的基本原则被气候数据管理界所忽视,主要是由于缺乏对数据库管理的数学逻辑相关性的强调,以及物理和逻辑抽象层次之间的误解。因此,气候数据管理工作流缺乏关系数据模型所提供的严谨性和正式性。这项工作在抽象的逻辑层面上解释了关系数据模型,并提供了论据,澄清了误解,并证明了在气候模型生成的网格数据的背景下将其用于气候数据管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Geosciences
Computers & Geosciences 地学-地球科学综合
CiteScore
9.30
自引率
6.80%
发文量
164
审稿时长
3.4 months
期刊介绍: Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信