Tao Jia, Ziling Fu, Rui Jiang, Zunliang Zou, Li Yang, Shuo Wang, Ningxin Jiao, Guanjie Zhao, Zhi Wang
{"title":"CMA-ES增强了光学非线性激活函数的优化","authors":"Tao Jia, Ziling Fu, Rui Jiang, Zunliang Zou, Li Yang, Shuo Wang, Ningxin Jiao, Guanjie Zhao, Zhi Wang","doi":"10.1016/j.optcom.2025.132145","DOIUrl":null,"url":null,"abstract":"<div><div>The development of all-optical and electro-optical neural networks represents a rapidly growing field of research. In this field, optimization and reconfigurability of optical nonlinear activation functions (ONAF) devices are crucial. In this work, we propose an optimization framework for MZI and MRR-based ONAF devices using the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). By incorporating this approach into the optimization of artificial neural networks (ANNs), the root mean square error (RMSE) of the fitted optimization functions can reach as low as −86.0 dB. We also present a theoretical analysis of electro-optical (E/O) modulation in directional couplers (DC), enabling reconfigurable power coupling coefficients. Additionally, we investigate an alternative MRR configuration where output light is coupled with MZI from the drop port instead of the through port, leading to optimal performance. This setup allows for the implementation of several activation functions including Parametric ReLU, which attains 100 % training accuracy and 99.1 % validation accuracy in the MNIST handwritten digit recognition task. Our findings provide new insights into the design and optimization of optical activation devices for integrated photonic neural networks.</div></div>","PeriodicalId":19586,"journal":{"name":"Optics Communications","volume":"591 ","pages":"Article 132145"},"PeriodicalIF":2.2000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CMA-ES enhanced optimization of optical nonlinear activation functions\",\"authors\":\"Tao Jia, Ziling Fu, Rui Jiang, Zunliang Zou, Li Yang, Shuo Wang, Ningxin Jiao, Guanjie Zhao, Zhi Wang\",\"doi\":\"10.1016/j.optcom.2025.132145\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The development of all-optical and electro-optical neural networks represents a rapidly growing field of research. In this field, optimization and reconfigurability of optical nonlinear activation functions (ONAF) devices are crucial. In this work, we propose an optimization framework for MZI and MRR-based ONAF devices using the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). By incorporating this approach into the optimization of artificial neural networks (ANNs), the root mean square error (RMSE) of the fitted optimization functions can reach as low as −86.0 dB. We also present a theoretical analysis of electro-optical (E/O) modulation in directional couplers (DC), enabling reconfigurable power coupling coefficients. Additionally, we investigate an alternative MRR configuration where output light is coupled with MZI from the drop port instead of the through port, leading to optimal performance. This setup allows for the implementation of several activation functions including Parametric ReLU, which attains 100 % training accuracy and 99.1 % validation accuracy in the MNIST handwritten digit recognition task. Our findings provide new insights into the design and optimization of optical activation devices for integrated photonic neural networks.</div></div>\",\"PeriodicalId\":19586,\"journal\":{\"name\":\"Optics Communications\",\"volume\":\"591 \",\"pages\":\"Article 132145\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optics Communications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S003040182500673X\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003040182500673X","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPTICS","Score":null,"Total":0}
CMA-ES enhanced optimization of optical nonlinear activation functions
The development of all-optical and electro-optical neural networks represents a rapidly growing field of research. In this field, optimization and reconfigurability of optical nonlinear activation functions (ONAF) devices are crucial. In this work, we propose an optimization framework for MZI and MRR-based ONAF devices using the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). By incorporating this approach into the optimization of artificial neural networks (ANNs), the root mean square error (RMSE) of the fitted optimization functions can reach as low as −86.0 dB. We also present a theoretical analysis of electro-optical (E/O) modulation in directional couplers (DC), enabling reconfigurable power coupling coefficients. Additionally, we investigate an alternative MRR configuration where output light is coupled with MZI from the drop port instead of the through port, leading to optimal performance. This setup allows for the implementation of several activation functions including Parametric ReLU, which attains 100 % training accuracy and 99.1 % validation accuracy in the MNIST handwritten digit recognition task. Our findings provide new insights into the design and optimization of optical activation devices for integrated photonic neural networks.
期刊介绍:
Optics Communications invites original and timely contributions containing new results in various fields of optics and photonics. The journal considers theoretical and experimental research in areas ranging from the fundamental properties of light to technological applications. Topics covered include classical and quantum optics, optical physics and light-matter interactions, lasers, imaging, guided-wave optics and optical information processing. Manuscripts should offer clear evidence of novelty and significance. Papers concentrating on mathematical and computational issues, with limited connection to optics, are not suitable for publication in the Journal. Similarly, small technical advances, or papers concerned only with engineering applications or issues of materials science fall outside the journal scope.