DisruptionBench and Complimentary New Models: Two Advancements in Machine Learning Driven Disruption Prediction

IF 1.9 4区 工程技术 Q1 NUCLEAR SCIENCE & TECHNOLOGY
Lucas Spangher, Matteo Bonotto, William Arnold, Dhruva Chayapathy, Tommaso Gallingani, Alexander Spangher, Francesco Cannarile, Daniele Bigoni, Eliana de Marchi, Cristina Rea
{"title":"DisruptionBench and Complimentary New Models: Two Advancements in Machine Learning Driven Disruption Prediction","authors":"Lucas Spangher,&nbsp;Matteo Bonotto,&nbsp;William Arnold,&nbsp;Dhruva Chayapathy,&nbsp;Tommaso Gallingani,&nbsp;Alexander Spangher,&nbsp;Francesco Cannarile,&nbsp;Daniele Bigoni,&nbsp;Eliana de Marchi,&nbsp;Cristina Rea","doi":"10.1007/s10894-025-00495-2","DOIUrl":null,"url":null,"abstract":"<div><p>Plasma disruptions remain a major obstacle to sustained commercial operation of tokamak-based fusion devices. Although machine learning (ML) methods have shown promise for predicting disruptions, their performance and generalizability suffer from a lack of common benchmarks and comprehensive multi-device evaluations. To address this, we present <b>DisruptionBench</b>, a new benchmarking platform designed to standardize how ML-driven disruption prediction systems are trained and evaluated on multi-machine data. DisruptionBench spans three devices - Alcator C-Mod, DIII-D, and EAST - and includes tasks of varying difficulty: zero-shot, few-shot, and many-shot training regimes to assess each model’s ability to transfer learned representations to new or data-limited machines. We evaluate four state-of-the-art ML architectures. Two are re-implementations of notable prior work: a random forest (Cristina Rea in PPCF 60:084008, 2018) and the Hybrid Deep Learner (HDL) (Zhu in NC 61: 026607, 2020). We also propose two new approaches tailored for disruption prediction: a transformer-based model inspired by GPT-2, capable of learning long-range temporal dependencies through self-attention, and a Continuous Convolutional Neural Network (CCNN) that leverages continuous kernels to capture subtle variations in plasma signals. Across the nine benchmarking tasks, the CCNN demonstrates consistently strong performance and achieves the highest overall Area Under the ROC Curve (AUC) in intra-machine tests (up to 0.97 on C-Mod). Nevertheless, the GPT-2-based approach and HDL can outperform CCNN in specific transfer scenarios, particularly when the test machine is underrepresented in training data. We further analyze the significance of memory length in capturing precursor phenomena, providing evidence that longer context windows can boost predictive accuracy.</p></div>","PeriodicalId":634,"journal":{"name":"Journal of Fusion Energy","volume":"44 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10894-025-00495-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Fusion Energy","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10894-025-00495-2","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Plasma disruptions remain a major obstacle to sustained commercial operation of tokamak-based fusion devices. Although machine learning (ML) methods have shown promise for predicting disruptions, their performance and generalizability suffer from a lack of common benchmarks and comprehensive multi-device evaluations. To address this, we present DisruptionBench, a new benchmarking platform designed to standardize how ML-driven disruption prediction systems are trained and evaluated on multi-machine data. DisruptionBench spans three devices - Alcator C-Mod, DIII-D, and EAST - and includes tasks of varying difficulty: zero-shot, few-shot, and many-shot training regimes to assess each model’s ability to transfer learned representations to new or data-limited machines. We evaluate four state-of-the-art ML architectures. Two are re-implementations of notable prior work: a random forest (Cristina Rea in PPCF 60:084008, 2018) and the Hybrid Deep Learner (HDL) (Zhu in NC 61: 026607, 2020). We also propose two new approaches tailored for disruption prediction: a transformer-based model inspired by GPT-2, capable of learning long-range temporal dependencies through self-attention, and a Continuous Convolutional Neural Network (CCNN) that leverages continuous kernels to capture subtle variations in plasma signals. Across the nine benchmarking tasks, the CCNN demonstrates consistently strong performance and achieves the highest overall Area Under the ROC Curve (AUC) in intra-machine tests (up to 0.97 on C-Mod). Nevertheless, the GPT-2-based approach and HDL can outperform CCNN in specific transfer scenarios, particularly when the test machine is underrepresented in training data. We further analyze the significance of memory length in capturing precursor phenomena, providing evidence that longer context windows can boost predictive accuracy.

中断工作台和补充新模型:机器学习驱动的中断预测的两项进展
等离子体干扰仍然是托卡马克聚变装置持续商业运行的主要障碍。尽管机器学习(ML)方法已经显示出预测中断的希望,但它们的性能和泛化性受到缺乏共同基准和全面的多设备评估的影响。为了解决这个问题,我们提出了DisruptionBench,这是一个新的基准平台,旨在标准化机器学习驱动的中断预测系统如何在多机器数据上进行训练和评估。DisruptionBench跨越三种设备- Alcator C-Mod, DIII-D和EAST -并包括不同难度的任务:零射击,少射击和多射击训练制度,以评估每个模型将学习表征转移到新的或数据有限的机器的能力。我们评估了四种最先进的机器学习架构。其中两个是对先前著名工作的重新实现:随机森林(Cristina Rea在PPCF 60:084008, 2018)和混合深度学习器(HDL) (Zhu在NC 61: 026607, 2020)。我们还提出了两种专门用于中断预测的新方法:受GPT-2启发的基于变压器的模型,能够通过自注意学习远程时间依赖性,以及利用连续核捕获等离子体信号中细微变化的连续卷积神经网络(CCNN)。在九个基准测试任务中,CCNN表现出一贯强劲的性能,并在机器内测试中达到最高的ROC曲线下的总体面积(AUC)(在C-Mod上高达0.97)。然而,基于gpt -2的方法和HDL可以在特定的传输场景中优于CCNN,特别是当测试机器在训练数据中代表性不足时。我们进一步分析了记忆长度在捕捉前兆现象中的重要性,提供了证据,证明更长的上下文窗口可以提高预测的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Fusion Energy
Journal of Fusion Energy 工程技术-核科学技术
CiteScore
2.20
自引率
0.00%
发文量
24
审稿时长
2.3 months
期刊介绍: The Journal of Fusion Energy features original research contributions and review papers examining and the development and enhancing the knowledge base of thermonuclear fusion as a potential power source. It is designed to serve as a journal of record for the publication of original research results in fundamental and applied physics, applied science and technological development. The journal publishes qualified papers based on peer reviews. This journal also provides a forum for discussing broader policies and strategies that have played, and will continue to play, a crucial role in fusion programs. In keeping with this theme, readers will find articles covering an array of important matters concerning strategy and program direction.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信