{"title":"An integrated large-scale photonic accelerator with ultralow latency","authors":"Shiyue Hua, Erwan Divita, Shanshan Yu, Bo Peng, Charles Roques-Carmes, Zhan Su, Zhang Chen, Yanfei Bai, Jinghui Zou, Yunpeng Zhu, Yelong Xu, Cheng-kuan Lu, Yuemiao Di, Hui Chen, Lushan Jiang, Lijie Wang, Longwu Ou, Chaohong Zhang, Junjie Chen, Wen Zhang, Hongyan Zhu, Weijun Kuang, Long Wang, Huaiyu Meng, Maurice Steinman, Yichen Shen","doi":"10.1038/s41586-025-08786-6","DOIUrl":"10.1038/s41586-025-08786-6","url":null,"abstract":"Integrated photonics, particularly silicon photonics, have emerged as cutting-edge technology driven by promising applications such as short-reach communications, autonomous driving, biosensing and photonic computing1–4. As advances in AI lead to growing computing demands, photonic computing has gained considerable attention as an appealing candidate. Nonetheless, there are substantial technical challenges in the scaling up of integrated photonics systems to realize these advantages, such as ensuring consistent performance gains in upscaled integrated device clusters, establishing standard designs and verification processes for complex circuits, as well as packaging large-scale systems. These obstacles arise primarily because of the relative immaturity of integrated photonics manufacturing and the scarcity of advanced packaging solutions involving photonics. Here we report a large-scale integrated photonic accelerator comprising more than 16,000 photonic components. The accelerator is designed to deliver standard linear matrix multiply–accumulate (MAC) functions, enabling computing with high speed up to 1 GHz frequency and low latency as small as 3 ns per cycle. Logic, memory and control functions that support photonic matrix MAC operations were designed into a cointegrated electronics chip. To seamlessly integrate the electronics and photonics chips at the commercial scale, we have made use of an innovative 2.5D hybrid advanced packaging approach. Through the development of this accelerator system, we demonstrate an ultralow computation latency for heuristic solvers of computationally hard Ising problems whose performance greatly relies on the computing latency. A large-scale photonic accelerator comprising more than 16,000 components integrated on a single chip to process MAC operations is described, demonstrating ultralow latency and reduced computing time compared with a commercially available GPU.","PeriodicalId":18787,"journal":{"name":"Nature","volume":"640 8058","pages":"361-367"},"PeriodicalIF":50.5,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41586-025-08786-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143806299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}