CREDIT-X1local:来自中国西南地区 ChinArray 的机器学习地震学参考数据集

IF 1.2 4区 地球科学 Q3 Earth and Planetary Sciences
Lu Li , Weitao Wang , Ziye Yu , Yini Chen
{"title":"CREDIT-X1local:来自中国西南地区 ChinArray 的机器学习地震学参考数据集","authors":"Lu Li ,&nbsp;Weitao Wang ,&nbsp;Ziye Yu ,&nbsp;Yini Chen","doi":"10.1016/j.eqs.2024.01.018","DOIUrl":null,"url":null,"abstract":"<div><p>High-quality datasets are critical for the development of advanced machine-learning algorithms in seismology. Here, we present an earthquake dataset based on the ChinArray Phase I records (X1). ChinArray Phase I was deployed in the southern north-south seismic zone (20° N–32° N, 95° E–110° E) in 2011–2013 using 355 portable broadband seismic stations. CREDIT-X1local, the first release of the ChinArray Reference Earthquake Dataset for Innovative Techniques (CREDIT), includes comprehensive information for the 105,455 local events that occurred in the southern north-south seismic zone during array observation, incorporating them into a single HDF5 file. Original 100-Hz sampled three-component waveforms are organized by event for stations within epicenter distances of 1,000 km, and records of ≥ 200 s are included for each waveform. Two types of phase labels are provided. The first includes manually picked labels for 5,999 events with magnitudes ≥ 2.0, providing 66,507 Pg, 42,310 Sg, 12,823 Pn, and 546 Sn phases. The second contains automatically labeled phases for 105,442 events with magnitudes of −1.6 to 7.6. These phases were picked using a recurrent neural network phase picker and screened using the corresponding travel time curves, resulting in 1,179,808 Pg, 884,281 Sg, 176,089 Pn, and 22,986 Sn phases. Additionally, first-motion polarities are included for 31,273 Pg phases. The event and station locations are provided, so that deep learning networks for both conventional phase picking and phase association can be trained and validated. The CREDIT-X1local dataset is the first million-scale dataset constructed from a dense seismic array, which is designed to support various multi-station deep-learning methods, high-precision focal mechanism inversion, and seismic tomography studies. Additionally, owing to the high seismicity in the southern north-south seismic zone in China, this dataset has great potential for future scientific discoveries.</p></div>","PeriodicalId":46333,"journal":{"name":"Earthquake Science","volume":"37 2","pages":"Pages 139-157"},"PeriodicalIF":1.2000,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1674451924000223/pdfft?md5=8e02eb44f9bdbf58fbce3bd1a13349cf&pid=1-s2.0-S1674451924000223-main.pdf","citationCount":"0","resultStr":"{\"title\":\"CREDIT-X1local: A reference dataset for machine learning seismology from ChinArray in Southwest China\",\"authors\":\"Lu Li ,&nbsp;Weitao Wang ,&nbsp;Ziye Yu ,&nbsp;Yini Chen\",\"doi\":\"10.1016/j.eqs.2024.01.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>High-quality datasets are critical for the development of advanced machine-learning algorithms in seismology. Here, we present an earthquake dataset based on the ChinArray Phase I records (X1). ChinArray Phase I was deployed in the southern north-south seismic zone (20° N–32° N, 95° E–110° E) in 2011–2013 using 355 portable broadband seismic stations. CREDIT-X1local, the first release of the ChinArray Reference Earthquake Dataset for Innovative Techniques (CREDIT), includes comprehensive information for the 105,455 local events that occurred in the southern north-south seismic zone during array observation, incorporating them into a single HDF5 file. Original 100-Hz sampled three-component waveforms are organized by event for stations within epicenter distances of 1,000 km, and records of ≥ 200 s are included for each waveform. Two types of phase labels are provided. The first includes manually picked labels for 5,999 events with magnitudes ≥ 2.0, providing 66,507 Pg, 42,310 Sg, 12,823 Pn, and 546 Sn phases. The second contains automatically labeled phases for 105,442 events with magnitudes of −1.6 to 7.6. These phases were picked using a recurrent neural network phase picker and screened using the corresponding travel time curves, resulting in 1,179,808 Pg, 884,281 Sg, 176,089 Pn, and 22,986 Sn phases. Additionally, first-motion polarities are included for 31,273 Pg phases. The event and station locations are provided, so that deep learning networks for both conventional phase picking and phase association can be trained and validated. The CREDIT-X1local dataset is the first million-scale dataset constructed from a dense seismic array, which is designed to support various multi-station deep-learning methods, high-precision focal mechanism inversion, and seismic tomography studies. Additionally, owing to the high seismicity in the southern north-south seismic zone in China, this dataset has great potential for future scientific discoveries.</p></div>\",\"PeriodicalId\":46333,\"journal\":{\"name\":\"Earthquake Science\",\"volume\":\"37 2\",\"pages\":\"Pages 139-157\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-02-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1674451924000223/pdfft?md5=8e02eb44f9bdbf58fbce3bd1a13349cf&pid=1-s2.0-S1674451924000223-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Earthquake Science\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1674451924000223\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Earth and Planetary Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earthquake Science","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1674451924000223","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

摘要

高质量的数据集对于开发先进的地震学机器学习算法至关重要。在此,我们介绍基于 ChinArray 第一阶段记录(X1)的地震数据集。ChinArray 第一阶段于 2011-2013 年在南部南北地震带(北纬 20°-32°,东经 95°-110°)部署,使用了 355 个便携式宽带地震台。CREDIT-X1local是ChinArray创新技术参考地震数据集(CREDIT)的首次发布,包括阵列观测期间在南部南北地震带发生的105,455次本地事件的综合信息,并将其合并为一个HDF5文件。原始的 100 赫兹采样三分量波形按震中距离在 1,000 公里以内的台站事件分类,每个波形都包含≥ 200 秒的记录。提供两种相位标签。第一种包括人工挑选的 5999 个震级≥ 2.0 事件的标签,提供 66507 个 Pg、42310 个 Sg、12823 个 Pn 和 546 个 Sn 相位。第二个相位包含 105,442 个震级为-1.6 到 7.6 的事件的自动标记相位。这些相位使用递归神经网络相位拾取器拾取,并使用相应的移动时间曲线进行筛选,最终得到 1,179,808 个 Pg 相位、884,281 个 Sg 相位、176,089 个 Pn 相位和 22,986 个 Sn 相位。此外,还包括 31 273 个 Pg 相位的第一运动极性。提供了事件和站点位置,以便对用于传统相位拾取和相位关联的深度学习网络进行训练和验证。CREDIT-X1local 数据集是首个由密集地震阵列构建的百万级数据集,旨在支持各种多台站深度学习方法、高精度焦点机制反演和地震层析成像研究。此外,由于中国南部南北地震带地震活动频繁,该数据集对未来科学发现具有巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
CREDIT-X1local: A reference dataset for machine learning seismology from ChinArray in Southwest China

High-quality datasets are critical for the development of advanced machine-learning algorithms in seismology. Here, we present an earthquake dataset based on the ChinArray Phase I records (X1). ChinArray Phase I was deployed in the southern north-south seismic zone (20° N–32° N, 95° E–110° E) in 2011–2013 using 355 portable broadband seismic stations. CREDIT-X1local, the first release of the ChinArray Reference Earthquake Dataset for Innovative Techniques (CREDIT), includes comprehensive information for the 105,455 local events that occurred in the southern north-south seismic zone during array observation, incorporating them into a single HDF5 file. Original 100-Hz sampled three-component waveforms are organized by event for stations within epicenter distances of 1,000 km, and records of ≥ 200 s are included for each waveform. Two types of phase labels are provided. The first includes manually picked labels for 5,999 events with magnitudes ≥ 2.0, providing 66,507 Pg, 42,310 Sg, 12,823 Pn, and 546 Sn phases. The second contains automatically labeled phases for 105,442 events with magnitudes of −1.6 to 7.6. These phases were picked using a recurrent neural network phase picker and screened using the corresponding travel time curves, resulting in 1,179,808 Pg, 884,281 Sg, 176,089 Pn, and 22,986 Sn phases. Additionally, first-motion polarities are included for 31,273 Pg phases. The event and station locations are provided, so that deep learning networks for both conventional phase picking and phase association can be trained and validated. The CREDIT-X1local dataset is the first million-scale dataset constructed from a dense seismic array, which is designed to support various multi-station deep-learning methods, high-precision focal mechanism inversion, and seismic tomography studies. Additionally, owing to the high seismicity in the southern north-south seismic zone in China, this dataset has great potential for future scientific discoveries.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Earthquake Science
Earthquake Science GEOCHEMISTRY & GEOPHYSICS-
CiteScore
1.10
自引率
8.30%
发文量
42
审稿时长
3 months
期刊介绍: Earthquake Science (EQS) aims to publish high-quality, original, peer-reviewed articles on earthquake-related research subjects. It is an English international journal sponsored by the Seismological Society of China and the Institute of Geophysics, China Earthquake Administration. The topics include, but not limited to, the following ● Seismic sources of all kinds. ● Earth structure at all scales. ● Seismotectonics. ● New methods and theoretical seismology. ● Strong ground motion. ● Seismic phenomena of all kinds. ● Seismic hazards, earthquake forecasting and prediction. ● Seismic instrumentation. ● Significant recent or past seismic events. ● Documentation of recent seismic events or important observations. ● Descriptions of field deployments, new methods, and available software tools. The types of manuscripts include the following. There is no length requirement, except for the Short Notes. 【Articles】 Original contributions that have not been published elsewhere. 【Short Notes】 Short papers of recent events or topics that warrant rapid peer reviews and publications. Limited to 4 publication pages. 【Rapid Communications】 Significant contributions that warrant rapid peer reviews and publications. 【Review Articles】Review articles are by invitation only. Please contact the editorial office and editors for possible proposals. 【Toolboxes】 Descriptions of novel numerical methods and associated computer codes. 【Data Products】 Documentation of datasets of various kinds that are interested to the community and available for open access (field data, processed data, synthetic data, or models). 【Opinions】Views on important topics and future directions in earthquake science. 【Comments and Replies】Commentaries on a recently published EQS paper is welcome. The authors of the paper commented will be invited to reply. Both the Comment and the Reply are subject to peer review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信