基于vae的数据增强在机器学习预测精度提高和不确定性降低方面的研究

IF 2.1 3区工程技术 Q1 NUCLEAR SCIENCE & TECHNOLOGY

Nuclear Engineering and Design Pub Date : 2025-09-11 DOI:10.1016/j.nucengdes.2025.114433

Farah Alsafadi, Mahmoud Yaseen, Xu Wu

{"title":"基于vae的数据增强在机器学习预测精度提高和不确定性降低方面的研究","authors":"Farah Alsafadi, Mahmoud Yaseen, Xu Wu","doi":"10.1016/j.nucengdes.2025.114433","DOIUrl":null,"url":null,"abstract":"<div><div>The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is <em>data scarcity</em> because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.</div></div>","PeriodicalId":19170,"journal":{"name":"Nuclear Engineering and Design","volume":"445 ","pages":"Article 114433"},"PeriodicalIF":2.1000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An investigation on machine learning predictive accuracy improvement and uncertainty reduction using VAE-based data augmentation\",\"authors\":\"Farah Alsafadi, Mahmoud Yaseen, Xu Wu\",\"doi\":\"10.1016/j.nucengdes.2025.114433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is <em>data scarcity</em> because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.</div></div>\",\"PeriodicalId\":19170,\"journal\":{\"name\":\"Nuclear Engineering and Design\",\"volume\":\"445 \",\"pages\":\"Article 114433\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nuclear Engineering and Design\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029549325006107\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NUCLEAR SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nuclear Engineering and Design","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029549325006107","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

具有大内存的超快计算机、机器学习（ML）算法的快速发展以及大型数据集的可用性的融合，使多个工程领域处于戏剧性进展的门槛。然而，核工程的一个独特挑战是数据匮乏，因为对核系统的实验通常比大多数其他学科更昂贵和耗时。解决数据稀缺问题的一种潜在方法是深度生成学习，它使用某些ML模型来学习现有数据的底层分布，并生成与真实数据相似的合成样本。通过这种方式，可以显着扩展数据集以训练更准确的预测ML模型。在这项研究中，我们的目标是评估使用基于变分自编码器（VAE）的深度生成模型进行数据增强的有效性。我们研究了数据增强是否会导致使用增强数据训练的深度神经网络（DNN）模型预测准确性的提高。此外，利用贝叶斯神经网络（BNN）和保形预测（CP）对DNN预测不确定性进行量化，以评估对预测不确定性降低的影响。为了测试所提出的方法，我们使用基于NUPEC沸水反应堆全尺寸细孔束试验（BFBT）基准的稳态空隙率数据的TRACE模拟。研究发现，利用VAEs对训练数据集进行扩充，提高了DNN模型的预测精度，提高了预测置信区间，降低了预测不确定性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An investigation on machine learning predictive accuracy improvement and uncertainty reduction using VAE-based data augmentation

The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is data scarcity because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Nuclear Engineering and Design 工程技术-核科学技术

CiteScore

3.40

自引率

11.80%

发文量

377

审稿时长

5 months

期刊介绍： Nuclear Engineering and Design covers the wide range of disciplines involved in the engineering, design, safety and construction of nuclear fission reactors. The Editors welcome papers both on applied and innovative aspects and developments in nuclear science and technology. Fundamentals of Reactor Design include: • Thermal-Hydraulics and Core Physics • Safety Analysis, Risk Assessment (PSA) • Structural and Mechanical Engineering • Materials Science • Fuel Behavior and Design • Structural Plant Design • Engineering of Reactor Components • Experiments Aspects beyond fundamentals of Reactor Design covered: • Accident Mitigation Measures • Reactor Control Systems • Licensing Issues • Safeguard Engineering • Economy of Plants • Reprocessing / Waste Disposal • Applications of Nuclear Energy • Maintenance • Decommissioning Papers on new reactor ideas and developments (Generation IV reactors) such as inherently safe modular HTRs, High Performance LWRs/HWRs and LMFBs/GFR will be considered; Actinide Burners, Accelerator Driven Systems, Energy Amplifiers and other special designs of power and research reactors and their applications are also encouraged.