{"title":"基于多头注意变压器的光网络多步跨度损耗预测","authors":"Maryam Hedayatnejad;Yinqing Pei;David Boertjes;Dacian Demeter;Christian Desrosiers;Christine Tremblay","doi":"10.1109/JPHOT.2025.3571220","DOIUrl":null,"url":null,"abstract":"Span Loss is a pivotal characteristic of optical networks, and its accurate prediction enables adjustment for optimal performance and proactive monitoring. Deep learning models such as transformers, with their self-attention mechanism, have shown potential for various prediction tasks. In this study, we propose the Transformer-XL (Extra Long) model for single-step and multi-step forecasting, trained with field data. We report on models predicting span loss from 15 minutes to 5 days, using window sizes of 15 minutes to 10 days. The single-step model's average Absolute Maximum Error (AME) is better than the naive model by 2.13 dB and outperforms linear regression by 0.05–0.32 dB across different window sizes. Our single-step model also achieves better performance than the Recurrent Neural Network (RNN) with an AME improvement of 0.02 dB. The average AME of our multi-step model exceeds the naive model's performance by a range of 2.95-3.05 dB, linear regression by a substantial 0.02-0.15 dB and RNN by a range of 0.04-0.54 dB across different window sizes and forecast horizons. Based on Root Mean Square Error (RMSE), the single-step model performs better than the naive approach across various window sizes by 0.07 dB, achieves up to 0.07 dB improvement over linear regression, and delivers comparable results to RNN. Moreover, our multi-step model improves upon the naive approach with RMSE by 0.04 dB and RNN by 0.02 across various window sizes and forecast horizons. It also demonstrates a slight improvement over linear regression.","PeriodicalId":13204,"journal":{"name":"IEEE Photonics Journal","volume":"17 3","pages":"1-8"},"PeriodicalIF":2.1000,"publicationDate":"2025-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11006412","citationCount":"0","resultStr":"{\"title\":\"Multi-Step Span Loss Prediction in Optical Networks Using Multi-Head Attention Transformers\",\"authors\":\"Maryam Hedayatnejad;Yinqing Pei;David Boertjes;Dacian Demeter;Christian Desrosiers;Christine Tremblay\",\"doi\":\"10.1109/JPHOT.2025.3571220\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Span Loss is a pivotal characteristic of optical networks, and its accurate prediction enables adjustment for optimal performance and proactive monitoring. Deep learning models such as transformers, with their self-attention mechanism, have shown potential for various prediction tasks. In this study, we propose the Transformer-XL (Extra Long) model for single-step and multi-step forecasting, trained with field data. We report on models predicting span loss from 15 minutes to 5 days, using window sizes of 15 minutes to 10 days. The single-step model's average Absolute Maximum Error (AME) is better than the naive model by 2.13 dB and outperforms linear regression by 0.05–0.32 dB across different window sizes. Our single-step model also achieves better performance than the Recurrent Neural Network (RNN) with an AME improvement of 0.02 dB. The average AME of our multi-step model exceeds the naive model's performance by a range of 2.95-3.05 dB, linear regression by a substantial 0.02-0.15 dB and RNN by a range of 0.04-0.54 dB across different window sizes and forecast horizons. Based on Root Mean Square Error (RMSE), the single-step model performs better than the naive approach across various window sizes by 0.07 dB, achieves up to 0.07 dB improvement over linear regression, and delivers comparable results to RNN. Moreover, our multi-step model improves upon the naive approach with RMSE by 0.04 dB and RNN by 0.02 across various window sizes and forecast horizons. It also demonstrates a slight improvement over linear regression.\",\"PeriodicalId\":13204,\"journal\":{\"name\":\"IEEE Photonics Journal\",\"volume\":\"17 3\",\"pages\":\"1-8\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11006412\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Photonics Journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11006412/\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Photonics Journal","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11006412/","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Multi-Step Span Loss Prediction in Optical Networks Using Multi-Head Attention Transformers
Span Loss is a pivotal characteristic of optical networks, and its accurate prediction enables adjustment for optimal performance and proactive monitoring. Deep learning models such as transformers, with their self-attention mechanism, have shown potential for various prediction tasks. In this study, we propose the Transformer-XL (Extra Long) model for single-step and multi-step forecasting, trained with field data. We report on models predicting span loss from 15 minutes to 5 days, using window sizes of 15 minutes to 10 days. The single-step model's average Absolute Maximum Error (AME) is better than the naive model by 2.13 dB and outperforms linear regression by 0.05–0.32 dB across different window sizes. Our single-step model also achieves better performance than the Recurrent Neural Network (RNN) with an AME improvement of 0.02 dB. The average AME of our multi-step model exceeds the naive model's performance by a range of 2.95-3.05 dB, linear regression by a substantial 0.02-0.15 dB and RNN by a range of 0.04-0.54 dB across different window sizes and forecast horizons. Based on Root Mean Square Error (RMSE), the single-step model performs better than the naive approach across various window sizes by 0.07 dB, achieves up to 0.07 dB improvement over linear regression, and delivers comparable results to RNN. Moreover, our multi-step model improves upon the naive approach with RMSE by 0.04 dB and RNN by 0.02 across various window sizes and forecast horizons. It also demonstrates a slight improvement over linear regression.
期刊介绍:
Breakthroughs in the generation of light and in its control and utilization have given rise to the field of Photonics, a rapidly expanding area of science and technology with major technological and economic impact. Photonics integrates quantum electronics and optics to accelerate progress in the generation of novel photon sources and in their utilization in emerging applications at the micro and nano scales spanning from the far-infrared/THz to the x-ray region of the electromagnetic spectrum. IEEE Photonics Journal is an online-only journal dedicated to the rapid disclosure of top-quality peer-reviewed research at the forefront of all areas of photonics. Contributions addressing issues ranging from fundamental understanding to emerging technologies and applications are within the scope of the Journal. The Journal includes topics in: Photon sources from far infrared to X-rays, Photonics materials and engineered photonic structures, Integrated optics and optoelectronic, Ultrafast, attosecond, high field and short wavelength photonics, Biophotonics, including DNA photonics, Nanophotonics, Magnetophotonics, Fundamentals of light propagation and interaction; nonlinear effects, Optical data storage, Fiber optics and optical communications devices, systems, and technologies, Micro Opto Electro Mechanical Systems (MOEMS), Microwave photonics, Optical Sensors.