Lucas Spangher, Matteo Bonotto, William Arnold, Dhruva Chayapathy, Tommaso Gallingani, Alexander Spangher, Francesco Cannarile, Daniele Bigoni, Eliana de Marchi, Cristina Rea
{"title":"DisruptionBench and Complimentary New Models: Two Advancements in Machine Learning Driven Disruption Prediction","authors":"Lucas Spangher, Matteo Bonotto, William Arnold, Dhruva Chayapathy, Tommaso Gallingani, Alexander Spangher, Francesco Cannarile, Daniele Bigoni, Eliana de Marchi, Cristina Rea","doi":"10.1007/s10894-025-00495-2","DOIUrl":null,"url":null,"abstract":"<div><p>Plasma disruptions remain a major obstacle to sustained commercial operation of tokamak-based fusion devices. Although machine learning (ML) methods have shown promise for predicting disruptions, their performance and generalizability suffer from a lack of common benchmarks and comprehensive multi-device evaluations. To address this, we present <b>DisruptionBench</b>, a new benchmarking platform designed to standardize how ML-driven disruption prediction systems are trained and evaluated on multi-machine data. DisruptionBench spans three devices - Alcator C-Mod, DIII-D, and EAST - and includes tasks of varying difficulty: zero-shot, few-shot, and many-shot training regimes to assess each model’s ability to transfer learned representations to new or data-limited machines. We evaluate four state-of-the-art ML architectures. Two are re-implementations of notable prior work: a random forest (Cristina Rea in PPCF 60:084008, 2018) and the Hybrid Deep Learner (HDL) (Zhu in NC 61: 026607, 2020). We also propose two new approaches tailored for disruption prediction: a transformer-based model inspired by GPT-2, capable of learning long-range temporal dependencies through self-attention, and a Continuous Convolutional Neural Network (CCNN) that leverages continuous kernels to capture subtle variations in plasma signals. Across the nine benchmarking tasks, the CCNN demonstrates consistently strong performance and achieves the highest overall Area Under the ROC Curve (AUC) in intra-machine tests (up to 0.97 on C-Mod). Nevertheless, the GPT-2-based approach and HDL can outperform CCNN in specific transfer scenarios, particularly when the test machine is underrepresented in training data. We further analyze the significance of memory length in capturing precursor phenomena, providing evidence that longer context windows can boost predictive accuracy.</p></div>","PeriodicalId":634,"journal":{"name":"Journal of Fusion Energy","volume":"44 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10894-025-00495-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Fusion Energy","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10894-025-00495-2","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Plasma disruptions remain a major obstacle to sustained commercial operation of tokamak-based fusion devices. Although machine learning (ML) methods have shown promise for predicting disruptions, their performance and generalizability suffer from a lack of common benchmarks and comprehensive multi-device evaluations. To address this, we present DisruptionBench, a new benchmarking platform designed to standardize how ML-driven disruption prediction systems are trained and evaluated on multi-machine data. DisruptionBench spans three devices - Alcator C-Mod, DIII-D, and EAST - and includes tasks of varying difficulty: zero-shot, few-shot, and many-shot training regimes to assess each model’s ability to transfer learned representations to new or data-limited machines. We evaluate four state-of-the-art ML architectures. Two are re-implementations of notable prior work: a random forest (Cristina Rea in PPCF 60:084008, 2018) and the Hybrid Deep Learner (HDL) (Zhu in NC 61: 026607, 2020). We also propose two new approaches tailored for disruption prediction: a transformer-based model inspired by GPT-2, capable of learning long-range temporal dependencies through self-attention, and a Continuous Convolutional Neural Network (CCNN) that leverages continuous kernels to capture subtle variations in plasma signals. Across the nine benchmarking tasks, the CCNN demonstrates consistently strong performance and achieves the highest overall Area Under the ROC Curve (AUC) in intra-machine tests (up to 0.97 on C-Mod). Nevertheless, the GPT-2-based approach and HDL can outperform CCNN in specific transfer scenarios, particularly when the test machine is underrepresented in training data. We further analyze the significance of memory length in capturing precursor phenomena, providing evidence that longer context windows can boost predictive accuracy.
期刊介绍:
The Journal of Fusion Energy features original research contributions and review papers examining and the development and enhancing the knowledge base of thermonuclear fusion as a potential power source. It is designed to serve as a journal of record for the publication of original research results in fundamental and applied physics, applied science and technological development. The journal publishes qualified papers based on peer reviews.
This journal also provides a forum for discussing broader policies and strategies that have played, and will continue to play, a crucial role in fusion programs. In keeping with this theme, readers will find articles covering an array of important matters concerning strategy and program direction.