{"title":"地质时间尺度的语义映射:一个时间参考","authors":"Susan Edelstein, Ben Norton","doi":"10.3897/biss.7.112232","DOIUrl":null,"url":null,"abstract":"The Geologic Time Scale is an ordered hierarchical set of terms representing specific time intervals in Earth's history. The hierarchical structure is correlated to the geologic record and major geologic events in Earth’s history (Gradstein et al. 2020). In the absence of quantitative numeric age values from absolute dating methods, the relative time intervals in the geologic time scale provide us with the vocabulary needed for deciphering Earth’s history and chronological reconstruction. This temporal frame of reference is critical to establishing correlations between specimens and how they fit within the Earth’s 4.567 Ga (giga annum) history.\n Due to spatial and temporal variations in the stratigraphic record, the terminology used in conjunction with geologic time scales is largely inconsistent. For a detailed discussion regarding term use in geologic timescales, see Cohen et al. (2013). As a result, published values for geologic timescale terms are often ambiguous and highly variable, limiting interoperability and hindering temporal correlations among specimens. A solution is to map verbatim geologic timescale values to a controlled vocabulary, constructing a single temporal frame of reference. The harmonization process is governed by an established set of business rules that can ultimately become fully or partially automated.\n In this study, we examined the Global Biodiversity Information Facility’s (GBIF) published distinct verbatim values for Darwin Core terms in the GeologicalContext Class of Darwin Core to assess the the use of chronostratiphic terms, a process highlighted in Sahdev et al. (2017). Preservation of these verbatim values, the initial unmapped set of published values, is important. Many are derived directly from primary source material and possess special historical and regional significance. These include land mammal ages (e.g., Lindsay (2003)), biostratigraphic zones, regional terms, and terms with higher granularity than the International Commission of Stratigraphy’s (ICS) timescale allows (e.g., subages/substages). For the purposes of this study, we selected the 2023/6 version of the ICS chronostratigraphic timescale as the controlled vocabulary (Cohen et al. 2023). The ICS is the most widely adopted timescale, comprising the most generalized and universally applicable intervals of geologic time.\n After semantic analysis of the verbatim values (see Table 1 for comparative statistics), we established a comprehensive set of business rules to map to the ICS timescale controlled vocabulary. This process yielded a collection of documented procedures to transform the heterogeneous collection of published terms into a semantically consistent dataset. The end result is a single temporal frame of reference for published geologic and paleontological specimens through semantic mapping to improve the temporal correlations between geologic specimens globally through data interoperability. This talk will highlight the process of harmonizing a heterogeneous collection of published verbatim Geologic Time Scale values with an established controlled vocabulary through semantic mapping.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic Mapping of the Geologic Time Scale: A temporal reference\",\"authors\":\"Susan Edelstein, Ben Norton\",\"doi\":\"10.3897/biss.7.112232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Geologic Time Scale is an ordered hierarchical set of terms representing specific time intervals in Earth's history. The hierarchical structure is correlated to the geologic record and major geologic events in Earth’s history (Gradstein et al. 2020). In the absence of quantitative numeric age values from absolute dating methods, the relative time intervals in the geologic time scale provide us with the vocabulary needed for deciphering Earth’s history and chronological reconstruction. This temporal frame of reference is critical to establishing correlations between specimens and how they fit within the Earth’s 4.567 Ga (giga annum) history.\\n Due to spatial and temporal variations in the stratigraphic record, the terminology used in conjunction with geologic time scales is largely inconsistent. For a detailed discussion regarding term use in geologic timescales, see Cohen et al. (2013). As a result, published values for geologic timescale terms are often ambiguous and highly variable, limiting interoperability and hindering temporal correlations among specimens. A solution is to map verbatim geologic timescale values to a controlled vocabulary, constructing a single temporal frame of reference. The harmonization process is governed by an established set of business rules that can ultimately become fully or partially automated.\\n In this study, we examined the Global Biodiversity Information Facility’s (GBIF) published distinct verbatim values for Darwin Core terms in the GeologicalContext Class of Darwin Core to assess the the use of chronostratiphic terms, a process highlighted in Sahdev et al. (2017). Preservation of these verbatim values, the initial unmapped set of published values, is important. Many are derived directly from primary source material and possess special historical and regional significance. These include land mammal ages (e.g., Lindsay (2003)), biostratigraphic zones, regional terms, and terms with higher granularity than the International Commission of Stratigraphy’s (ICS) timescale allows (e.g., subages/substages). For the purposes of this study, we selected the 2023/6 version of the ICS chronostratigraphic timescale as the controlled vocabulary (Cohen et al. 2023). The ICS is the most widely adopted timescale, comprising the most generalized and universally applicable intervals of geologic time.\\n After semantic analysis of the verbatim values (see Table 1 for comparative statistics), we established a comprehensive set of business rules to map to the ICS timescale controlled vocabulary. This process yielded a collection of documented procedures to transform the heterogeneous collection of published terms into a semantically consistent dataset. The end result is a single temporal frame of reference for published geologic and paleontological specimens through semantic mapping to improve the temporal correlations between geologic specimens globally through data interoperability. This talk will highlight the process of harmonizing a heterogeneous collection of published verbatim Geologic Time Scale values with an established controlled vocabulary through semantic mapping.\",\"PeriodicalId\":9011,\"journal\":{\"name\":\"Biodiversity Information Science and Standards\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biodiversity Information Science and Standards\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3897/biss.7.112232\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biodiversity Information Science and Standards","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/biss.7.112232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
地质年标是一组有序的、层次分明的术语,代表地球历史上特定的时间间隔。分层结构与地质记录和地球历史上的重大地质事件相关(Gradstein et al. 2020)。在绝对测年方法缺乏定量的数字年龄值的情况下,地质时间尺度上的相对时间间隔为我们提供了破译地球历史和重建时间顺序所需的词汇。这个时间参考框架对于建立标本之间的相关性以及它们如何适应地球4.567亿年(千兆年)历史至关重要。由于地层记录的时空变化,与地质时间尺度结合使用的术语在很大程度上是不一致的。有关术语在地质时间尺度上使用的详细讨论,请参见Cohen et al.(2013)。因此,地质时标项的公布值往往是模糊的和高度可变的,限制了互操作性,阻碍了标本之间的时间相关性。一种解决方案是将逐字的地质时间标度值映射到受控词汇表,构建一个单一的时间参考框架。协调过程由一组已建立的业务规则控制,这些业务规则最终可以完全或部分自动化。在本研究中,我们检查了全球生物多样性信息设施(GBIF)在达尔文核心的地质背景类中发布的达尔文核心术语的不同逐字值,以评估时间地层术语的使用,Sahdev等人(2017)强调了这一过程。保存这些逐字值(发布值的初始未映射集)非常重要。许多是直接来源于原始材料,具有特殊的历史和地域意义。这些包括陆地哺乳动物的年龄(例如Lindsay(2003))、生物地层带、区域术语以及比国际地层学委员会(ICS)的时间尺度允许的粒度更高的术语(例如亚年龄/亚阶段)。为了本研究的目的,我们选择了2023/6版本的ICS年代地层时间标度作为控制词汇(Cohen et al. 2023)。ICS是采用最广泛的时间标度,它包括最广义和普遍适用的地质时间间隔。在对逐字值进行语义分析之后(参见表1的比较统计数据),我们建立了一组全面的业务规则,以映射到ICS时间刻度控制词汇表。这个过程产生了一组文档化的过程,用于将异构的已发布术语集合转换为语义一致的数据集。最终结果是通过语义映射为已发表的地质和古生物标本提供一个单一的时间参考框架,通过数据互操作性提高全球地质标本之间的时间相关性。本次演讲将重点介绍通过语义映射将已发表的逐字地质时间尺度值的异构集合与已建立的受控词汇表协调起来的过程。
Semantic Mapping of the Geologic Time Scale: A temporal reference
The Geologic Time Scale is an ordered hierarchical set of terms representing specific time intervals in Earth's history. The hierarchical structure is correlated to the geologic record and major geologic events in Earth’s history (Gradstein et al. 2020). In the absence of quantitative numeric age values from absolute dating methods, the relative time intervals in the geologic time scale provide us with the vocabulary needed for deciphering Earth’s history and chronological reconstruction. This temporal frame of reference is critical to establishing correlations between specimens and how they fit within the Earth’s 4.567 Ga (giga annum) history.
Due to spatial and temporal variations in the stratigraphic record, the terminology used in conjunction with geologic time scales is largely inconsistent. For a detailed discussion regarding term use in geologic timescales, see Cohen et al. (2013). As a result, published values for geologic timescale terms are often ambiguous and highly variable, limiting interoperability and hindering temporal correlations among specimens. A solution is to map verbatim geologic timescale values to a controlled vocabulary, constructing a single temporal frame of reference. The harmonization process is governed by an established set of business rules that can ultimately become fully or partially automated.
In this study, we examined the Global Biodiversity Information Facility’s (GBIF) published distinct verbatim values for Darwin Core terms in the GeologicalContext Class of Darwin Core to assess the the use of chronostratiphic terms, a process highlighted in Sahdev et al. (2017). Preservation of these verbatim values, the initial unmapped set of published values, is important. Many are derived directly from primary source material and possess special historical and regional significance. These include land mammal ages (e.g., Lindsay (2003)), biostratigraphic zones, regional terms, and terms with higher granularity than the International Commission of Stratigraphy’s (ICS) timescale allows (e.g., subages/substages). For the purposes of this study, we selected the 2023/6 version of the ICS chronostratigraphic timescale as the controlled vocabulary (Cohen et al. 2023). The ICS is the most widely adopted timescale, comprising the most generalized and universally applicable intervals of geologic time.
After semantic analysis of the verbatim values (see Table 1 for comparative statistics), we established a comprehensive set of business rules to map to the ICS timescale controlled vocabulary. This process yielded a collection of documented procedures to transform the heterogeneous collection of published terms into a semantically consistent dataset. The end result is a single temporal frame of reference for published geologic and paleontological specimens through semantic mapping to improve the temporal correlations between geologic specimens globally through data interoperability. This talk will highlight the process of harmonizing a heterogeneous collection of published verbatim Geologic Time Scale values with an established controlled vocabulary through semantic mapping.