{"title":"基于vae的数据增强在机器学习预测精度提高和不确定性降低方面的研究","authors":"Farah Alsafadi, Mahmoud Yaseen, Xu Wu","doi":"10.1016/j.nucengdes.2025.114433","DOIUrl":null,"url":null,"abstract":"<div><div>The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is <em>data scarcity</em> because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.</div></div>","PeriodicalId":19170,"journal":{"name":"Nuclear Engineering and Design","volume":"445 ","pages":"Article 114433"},"PeriodicalIF":2.1000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An investigation on machine learning predictive accuracy improvement and uncertainty reduction using VAE-based data augmentation\",\"authors\":\"Farah Alsafadi, Mahmoud Yaseen, Xu Wu\",\"doi\":\"10.1016/j.nucengdes.2025.114433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is <em>data scarcity</em> because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.</div></div>\",\"PeriodicalId\":19170,\"journal\":{\"name\":\"Nuclear Engineering and Design\",\"volume\":\"445 \",\"pages\":\"Article 114433\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nuclear Engineering and Design\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029549325006107\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NUCLEAR SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nuclear Engineering and Design","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029549325006107","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
An investigation on machine learning predictive accuracy improvement and uncertainty reduction using VAE-based data augmentation
The confluence of ultrafast computers with large memory, rapid progress in Machine Learning (ML) algorithms, and the availability of large datasets place multiple engineering fields at the threshold of dramatic progress. However, a unique challenge in nuclear engineering is data scarcity because experimentation on nuclear systems is usually more expensive and time-consuming than most other disciplines. One potential way to resolve the data scarcity issue is deep generative learning, which uses certain ML models to learn the underlying distribution of existing data and generate synthetic samples that resemble the real data. In this way, one can significantly expand the dataset to train more accurate predictive ML models. In this study, our objective is to evaluate the effectiveness of data augmentation using variational autoencoder (VAE)-based deep generative models. We investigated whether the data augmentation leads to improved accuracy in the predictions of a deep neural network (DNN) model trained using the augmented data. Additionally, the DNN prediction uncertainties are quantified using Bayesian Neural Networks (BNN) and conformal prediction (CP) to assess the impact on predictive uncertainty reduction. To test the proposed methodology, we used TRACE simulations of steady-state void fraction data based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark. We found that augmenting the training dataset using VAEs has improved the DNN model’s predictive accuracy, improved the prediction confidence intervals, and reduced the prediction uncertainties.
期刊介绍:
Nuclear Engineering and Design covers the wide range of disciplines involved in the engineering, design, safety and construction of nuclear fission reactors. The Editors welcome papers both on applied and innovative aspects and developments in nuclear science and technology.
Fundamentals of Reactor Design include:
• Thermal-Hydraulics and Core Physics
• Safety Analysis, Risk Assessment (PSA)
• Structural and Mechanical Engineering
• Materials Science
• Fuel Behavior and Design
• Structural Plant Design
• Engineering of Reactor Components
• Experiments
Aspects beyond fundamentals of Reactor Design covered:
• Accident Mitigation Measures
• Reactor Control Systems
• Licensing Issues
• Safeguard Engineering
• Economy of Plants
• Reprocessing / Waste Disposal
• Applications of Nuclear Energy
• Maintenance
• Decommissioning
Papers on new reactor ideas and developments (Generation IV reactors) such as inherently safe modular HTRs, High Performance LWRs/HWRs and LMFBs/GFR will be considered; Actinide Burners, Accelerator Driven Systems, Energy Amplifiers and other special designs of power and research reactors and their applications are also encouraged.