支持托卡马克数据机器学习研究的开放数据服务

IF 1.5 4区 物理与天体物理 Q3 PHYSICS, FLUIDS & PLASMAS
Samuel Jackson;Saiful Khan;Nathan Cummings;James Hodson;Shaun de Witt;Stanislas Pamela;Rob Akers;Jeyan Thiyagalingam;MAST Team
{"title":"支持托卡马克数据机器学习研究的开放数据服务","authors":"Samuel Jackson;Saiful Khan;Nathan Cummings;James Hodson;Shaun de Witt;Stanislas Pamela;Rob Akers;Jeyan Thiyagalingam;MAST Team","doi":"10.1109/TPS.2025.3583419","DOIUrl":null,"url":null,"abstract":"The increasing complexity and volume of plasma fusion experimental data, coupled with the growing adoption of machine learning in fusion research, necessitate advanced and efficient data management solutions. We propose an open data service for fusion experiments operated by the UKAEA, designed to address the evolving needs of machine-learning-driven fusion research. Our system provides a framework to organize MAST, MAST upgrade (MAST-U), and Joint European Torus (JET) experimental data in accordance with findability, accessibility, interoperability, and reuse (FAIR) principles, using distributed object storage for scalability and a relational database for efficient metadata indexing. In addition, it offers simplified abstractions through an application programming interface (API), facilitating seamless data access and integration with data analysis and machine learning workflows. Performance evaluation of metrics such as data load time and throughput, across varying numbers of parallel workers, demonstrates the data pipeline’s optimization for efficient machine learning application development. Our solution significantly enhances support for data-driven research and machine learning applications in fusion by laying the groundwork for open, FAIR-compliant fusion data, which enables cross-machine analysis, prompts international collaboration, and potentially accelerates advancements in fusion energy research.","PeriodicalId":450,"journal":{"name":"IEEE Transactions on Plasma Science","volume":"53 9","pages":"2440-2449"},"PeriodicalIF":1.5000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11128905","citationCount":"0","resultStr":"{\"title\":\"An Open Data Service for Supporting Research in Machine Learning on Tokamak Data\",\"authors\":\"Samuel Jackson;Saiful Khan;Nathan Cummings;James Hodson;Shaun de Witt;Stanislas Pamela;Rob Akers;Jeyan Thiyagalingam;MAST Team\",\"doi\":\"10.1109/TPS.2025.3583419\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing complexity and volume of plasma fusion experimental data, coupled with the growing adoption of machine learning in fusion research, necessitate advanced and efficient data management solutions. We propose an open data service for fusion experiments operated by the UKAEA, designed to address the evolving needs of machine-learning-driven fusion research. Our system provides a framework to organize MAST, MAST upgrade (MAST-U), and Joint European Torus (JET) experimental data in accordance with findability, accessibility, interoperability, and reuse (FAIR) principles, using distributed object storage for scalability and a relational database for efficient metadata indexing. In addition, it offers simplified abstractions through an application programming interface (API), facilitating seamless data access and integration with data analysis and machine learning workflows. Performance evaluation of metrics such as data load time and throughput, across varying numbers of parallel workers, demonstrates the data pipeline’s optimization for efficient machine learning application development. Our solution significantly enhances support for data-driven research and machine learning applications in fusion by laying the groundwork for open, FAIR-compliant fusion data, which enables cross-machine analysis, prompts international collaboration, and potentially accelerates advancements in fusion energy research.\",\"PeriodicalId\":450,\"journal\":{\"name\":\"IEEE Transactions on Plasma Science\",\"volume\":\"53 9\",\"pages\":\"2440-2449\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11128905\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Plasma Science\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11128905/\",\"RegionNum\":4,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PHYSICS, FLUIDS & PLASMAS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Plasma Science","FirstCategoryId":"101","ListUrlMain":"https://ieeexplore.ieee.org/document/11128905/","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, FLUIDS & PLASMAS","Score":null,"Total":0}
引用次数: 0

摘要

等离子体聚变实验数据的复杂性和数量不断增加,再加上融合研究中越来越多地采用机器学习,需要先进高效的数据管理解决方案。我们建议为UKAEA运营的聚变实验提供开放数据服务,旨在满足机器学习驱动的聚变研究不断发展的需求。我们的系统按照可查找性、可访问性、互操作性和重用性(FAIR)原则提供了一个框架来组织MAST、MAST升级(MAST- u)和联合欧洲环面(JET)实验数据,使用分布式对象存储实现可伸缩性,使用关系数据库实现高效的元数据索引。此外,它通过应用程序编程接口(API)提供简化的抽象,促进无缝数据访问以及与数据分析和机器学习工作流程的集成。在不同数量的并行工作中,对数据加载时间和吞吐量等指标进行性能评估,展示了数据管道对高效机器学习应用程序开发的优化。我们的解决方案通过为开放、符合fair标准的聚变数据奠定基础,显著增强了对聚变领域数据驱动研究和机器学习应用的支持,从而实现了跨机器分析,促进了国际合作,并有可能加速聚变能源研究的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Open Data Service for Supporting Research in Machine Learning on Tokamak Data
The increasing complexity and volume of plasma fusion experimental data, coupled with the growing adoption of machine learning in fusion research, necessitate advanced and efficient data management solutions. We propose an open data service for fusion experiments operated by the UKAEA, designed to address the evolving needs of machine-learning-driven fusion research. Our system provides a framework to organize MAST, MAST upgrade (MAST-U), and Joint European Torus (JET) experimental data in accordance with findability, accessibility, interoperability, and reuse (FAIR) principles, using distributed object storage for scalability and a relational database for efficient metadata indexing. In addition, it offers simplified abstractions through an application programming interface (API), facilitating seamless data access and integration with data analysis and machine learning workflows. Performance evaluation of metrics such as data load time and throughput, across varying numbers of parallel workers, demonstrates the data pipeline’s optimization for efficient machine learning application development. Our solution significantly enhances support for data-driven research and machine learning applications in fusion by laying the groundwork for open, FAIR-compliant fusion data, which enables cross-machine analysis, prompts international collaboration, and potentially accelerates advancements in fusion energy research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Plasma Science
IEEE Transactions on Plasma Science 物理-物理:流体与等离子体
CiteScore
3.00
自引率
20.00%
发文量
538
审稿时长
3.8 months
期刊介绍: The scope covers all aspects of the theory and application of plasma science. It includes the following areas: magnetohydrodynamics; thermionics and plasma diodes; basic plasma phenomena; gaseous electronics; microwave/plasma interaction; electron, ion, and plasma sources; space plasmas; intense electron and ion beams; laser-plasma interactions; plasma diagnostics; plasma chemistry and processing; solid-state plasmas; plasma heating; plasma for controlled fusion research; high energy density plasmas; industrial/commercial applications of plasma physics; plasma waves and instabilities; and high power microwave and submillimeter wave generation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信