聊天到芯片：基于大语言模型的任意形状元表面的设计

IF 6.6 2区物理与天体物理 Q1 MATERIALS SCIENCE, MULTIDISCIPLINARY

Nanophotonics Pub Date : 2025-10-09 DOI:10.1515/nanoph-2025-0343

Huanshu Zhang, Lei Kang, Sawyer D. Campbell, Douglas H. Werner

{"title":"聊天到芯片：基于大语言模型的任意形状元表面的设计","authors":"Huanshu Zhang, Lei Kang, Sawyer D. Campbell, Douglas H. Werner","doi":"10.1515/nanoph-2025-0343","DOIUrl":null,"url":null,"abstract":"Traditional metasurface design is limited by the computational cost of full-wave simulations, preventing thorough exploration of complex configurations. Data-driven approaches have emerged as a solution to this bottleneck, replacing costly simulations with rapid neural network evaluations and enabling near-instant design for meta-atoms. Despite advances, implementing a new optical function still requires building and training a task-specific network, along with exhaustive searches for suitable architectures and hyperparameters. Pre-trained large language models (LLMs), by contrast, sidestep this laborious process with a simple fine-tuning technique. However, applying LLMs to the design of nanophotonic devices, particularly for arbitrarily shaped metasurfaces, is still in its early stages; as such tasks often require graphical networks. Here, we show that an LLM, fed with descriptive inputs of arbitrarily shaped metasurface geometries, can learn the physical relationships needed for spectral prediction and inverse design. We further benchmarked a range of open-weight LLMs and identified relationships between accuracy and model size at the billion-parameter level. We demonstrated that 1-D token-wise LLMs provide a practical tool for designing 2-D arbitrarily shaped metasurfaces. Linking natural-language interaction to electromagnetic modelling, this “chat-to-chip” workflow represents a step toward more user-friendly data-driven nanophotonics.","PeriodicalId":19027,"journal":{"name":"Nanophotonics","volume":"213 1","pages":""},"PeriodicalIF":6.6000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chat to chip: large language model based design of arbitrarily shaped metasurfaces\",\"authors\":\"Huanshu Zhang, Lei Kang, Sawyer D. Campbell, Douglas H. Werner\",\"doi\":\"10.1515/nanoph-2025-0343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditional metasurface design is limited by the computational cost of full-wave simulations, preventing thorough exploration of complex configurations. Data-driven approaches have emerged as a solution to this bottleneck, replacing costly simulations with rapid neural network evaluations and enabling near-instant design for meta-atoms. Despite advances, implementing a new optical function still requires building and training a task-specific network, along with exhaustive searches for suitable architectures and hyperparameters. Pre-trained large language models (LLMs), by contrast, sidestep this laborious process with a simple fine-tuning technique. However, applying LLMs to the design of nanophotonic devices, particularly for arbitrarily shaped metasurfaces, is still in its early stages; as such tasks often require graphical networks. Here, we show that an LLM, fed with descriptive inputs of arbitrarily shaped metasurface geometries, can learn the physical relationships needed for spectral prediction and inverse design. We further benchmarked a range of open-weight LLMs and identified relationships between accuracy and model size at the billion-parameter level. We demonstrated that 1-D token-wise LLMs provide a practical tool for designing 2-D arbitrarily shaped metasurfaces. Linking natural-language interaction to electromagnetic modelling, this “chat-to-chip” workflow represents a step toward more user-friendly data-driven nanophotonics.\",\"PeriodicalId\":19027,\"journal\":{\"name\":\"Nanophotonics\",\"volume\":\"213 1\",\"pages\":\"\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nanophotonics\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1515/nanoph-2025-0343\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nanophotonics","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1515/nanoph-2025-0343","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

传统的超表面设计受到全波模拟计算成本的限制，阻碍了对复杂结构的深入探索。数据驱动的方法已经成为解决这一瓶颈的方法，用快速的神经网络评估取代昂贵的模拟，并实现元原子的近乎即时设计。尽管取得了进步，但实现新的光学功能仍然需要构建和训练特定于任务的网络，以及对合适架构和超参数的详尽搜索。相比之下，预训练的大型语言模型（llm）通过简单的微调技术避开了这个费力的过程。然而，将llm应用于纳米光子器件的设计，特别是任意形状的超表面，仍处于早期阶段；这样的任务通常需要图形网络。在这里，我们展示了一个LLM，用任意形状的超表面几何形状的描述性输入，可以学习光谱预测和逆设计所需的物理关系。我们进一步对一系列开重llm进行了基准测试，并在十亿参数级别确定了精度与模型尺寸之间的关系。我们证明了一维标记智能llm为设计二维任意形状的元表面提供了一个实用的工具。将自然语言交互与电磁建模相结合，这种“从聊天到芯片”的工作流程代表了向更用户友好的数据驱动纳米光子学迈出的一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Chat to chip: large language model based design of arbitrarily shaped metasurfaces

Traditional metasurface design is limited by the computational cost of full-wave simulations, preventing thorough exploration of complex configurations. Data-driven approaches have emerged as a solution to this bottleneck, replacing costly simulations with rapid neural network evaluations and enabling near-instant design for meta-atoms. Despite advances, implementing a new optical function still requires building and training a task-specific network, along with exhaustive searches for suitable architectures and hyperparameters. Pre-trained large language models (LLMs), by contrast, sidestep this laborious process with a simple fine-tuning technique. However, applying LLMs to the design of nanophotonic devices, particularly for arbitrarily shaped metasurfaces, is still in its early stages; as such tasks often require graphical networks. Here, we show that an LLM, fed with descriptive inputs of arbitrarily shaped metasurface geometries, can learn the physical relationships needed for spectral prediction and inverse design. We further benchmarked a range of open-weight LLMs and identified relationships between accuracy and model size at the billion-parameter level. We demonstrated that 1-D token-wise LLMs provide a practical tool for designing 2-D arbitrarily shaped metasurfaces. Linking natural-language interaction to electromagnetic modelling, this “chat-to-chip” workflow represents a step toward more user-friendly data-driven nanophotonics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Nanophotonics NANOSCIENCE & NANOTECHNOLOGY-MATERIALS SCIENCE, MULTIDISCIPLINARY

CiteScore

13.50

自引率

6.70%

发文量

358

审稿时长

7 weeks

期刊介绍： Nanophotonics, published in collaboration with Sciencewise, is a prestigious journal that showcases recent international research results, notable advancements in the field, and innovative applications. It is regarded as one of the leading publications in the realm of nanophotonics and encompasses a range of article types including research articles, selectively invited reviews, letters, and perspectives. The journal specifically delves into the study of photon interaction with nano-structures, such as carbon nano-tubes, nano metal particles, nano crystals, semiconductor nano dots, photonic crystals, tissue, and DNA. It offers comprehensive coverage of the most up-to-date discoveries, making it an essential resource for physicists, engineers, and material scientists.