Ben Qi, Jun Sun, Zhe Sui, Xingyu Xiao, Jingang Liang
{"title":"利用大语言模型进行多模态学习,改进核电站的瞬态识别","authors":"Ben Qi, Jun Sun, Zhe Sui, Xingyu Xiao, Jingang Liang","doi":"10.1016/j.pnucene.2024.105421","DOIUrl":null,"url":null,"abstract":"<div><p>Transients are events that cause nuclear power plants (NPPs) to transition from a normal state to an abnormal state, which can lead to severe accidents if not properly handled. Transient identification is crucial for NPPs’ safety and operation. In this paper, we propose a novel multimodal text-time series learning framework(MTTL), the first work to apply a large language model for transient identification. The MTTL consists of self-supervised learning pre-training and zero-shot classification for transient identification. During pre-training, the framework utilizes a large language model(LLM) and a time-series(TS) encoder to fully exploit the rich multimodal information available in NPPs, i.e., to obtain the embeddings of both text data and TS data. The LLM is used to capture the transient knowledge of the NPPs by learning from the text data, and the TS encoder is used to capture the temporal dependencies of the transients by encoding the TS data. Both the LLM and the TS encoder have a linear projection head to map the embeddings into a common space. The similarity between the embeddings of the text and TS data is calculated to minimize the contrastive learning loss and obtain a pre-trained model with rich transient knowledge. During the zero-shot classification, the framework utilizes a pre-trained model to effectively identify real-world NPP transients where the data is different from the pre-trained simulated data. The proposed framework is evaluated on the High-Temperature Reactor-Pebblebed Modules (HTR-PM) plant, and the results demonstrate that the MTTL outperforms several baseline methods, including Transformer, LSTM, and CNN1D. The better zero-shot transient identification capability makes it possible to perform better in real-world NPPs.</p></div>","PeriodicalId":20617,"journal":{"name":"Progress in Nuclear Energy","volume":"177 ","pages":"Article 105421"},"PeriodicalIF":3.3000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multimodal learning using large language models to improve transient identification of nuclear power plants\",\"authors\":\"Ben Qi, Jun Sun, Zhe Sui, Xingyu Xiao, Jingang Liang\",\"doi\":\"10.1016/j.pnucene.2024.105421\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Transients are events that cause nuclear power plants (NPPs) to transition from a normal state to an abnormal state, which can lead to severe accidents if not properly handled. Transient identification is crucial for NPPs’ safety and operation. In this paper, we propose a novel multimodal text-time series learning framework(MTTL), the first work to apply a large language model for transient identification. The MTTL consists of self-supervised learning pre-training and zero-shot classification for transient identification. During pre-training, the framework utilizes a large language model(LLM) and a time-series(TS) encoder to fully exploit the rich multimodal information available in NPPs, i.e., to obtain the embeddings of both text data and TS data. The LLM is used to capture the transient knowledge of the NPPs by learning from the text data, and the TS encoder is used to capture the temporal dependencies of the transients by encoding the TS data. Both the LLM and the TS encoder have a linear projection head to map the embeddings into a common space. The similarity between the embeddings of the text and TS data is calculated to minimize the contrastive learning loss and obtain a pre-trained model with rich transient knowledge. During the zero-shot classification, the framework utilizes a pre-trained model to effectively identify real-world NPP transients where the data is different from the pre-trained simulated data. The proposed framework is evaluated on the High-Temperature Reactor-Pebblebed Modules (HTR-PM) plant, and the results demonstrate that the MTTL outperforms several baseline methods, including Transformer, LSTM, and CNN1D. The better zero-shot transient identification capability makes it possible to perform better in real-world NPPs.</p></div>\",\"PeriodicalId\":20617,\"journal\":{\"name\":\"Progress in Nuclear Energy\",\"volume\":\"177 \",\"pages\":\"Article 105421\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Progress in Nuclear Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0149197024003718\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NUCLEAR SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Progress in Nuclear Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0149197024003718","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
Multimodal learning using large language models to improve transient identification of nuclear power plants
Transients are events that cause nuclear power plants (NPPs) to transition from a normal state to an abnormal state, which can lead to severe accidents if not properly handled. Transient identification is crucial for NPPs’ safety and operation. In this paper, we propose a novel multimodal text-time series learning framework(MTTL), the first work to apply a large language model for transient identification. The MTTL consists of self-supervised learning pre-training and zero-shot classification for transient identification. During pre-training, the framework utilizes a large language model(LLM) and a time-series(TS) encoder to fully exploit the rich multimodal information available in NPPs, i.e., to obtain the embeddings of both text data and TS data. The LLM is used to capture the transient knowledge of the NPPs by learning from the text data, and the TS encoder is used to capture the temporal dependencies of the transients by encoding the TS data. Both the LLM and the TS encoder have a linear projection head to map the embeddings into a common space. The similarity between the embeddings of the text and TS data is calculated to minimize the contrastive learning loss and obtain a pre-trained model with rich transient knowledge. During the zero-shot classification, the framework utilizes a pre-trained model to effectively identify real-world NPP transients where the data is different from the pre-trained simulated data. The proposed framework is evaluated on the High-Temperature Reactor-Pebblebed Modules (HTR-PM) plant, and the results demonstrate that the MTTL outperforms several baseline methods, including Transformer, LSTM, and CNN1D. The better zero-shot transient identification capability makes it possible to perform better in real-world NPPs.
期刊介绍:
Progress in Nuclear Energy is an international review journal covering all aspects of nuclear science and engineering. In keeping with the maturity of nuclear power, articles on safety, siting and environmental problems are encouraged, as are those associated with economics and fuel management. However, basic physics and engineering will remain an important aspect of the editorial policy. Articles published are either of a review nature or present new material in more depth. They are aimed at researchers and technically-oriented managers working in the nuclear energy field.
Please note the following:
1) PNE seeks high quality research papers which are medium to long in length. Short research papers should be submitted to the journal Annals in Nuclear Energy.
2) PNE reserves the right to reject papers which are based solely on routine application of computer codes used to produce reactor designs or explain existing reactor phenomena. Such papers, although worthy, are best left as laboratory reports whereas Progress in Nuclear Energy seeks papers of originality, which are archival in nature, in the fields of mathematical and experimental nuclear technology, including fission, fusion (blanket physics, radiation damage), safety, materials aspects, economics, etc.
3) Review papers, which may occasionally be invited, are particularly sought by the journal in these fields.