Agentic metasurface design with self-correcting language-model systems

Bei Wu; Bo Xiong; Haiyao Luo; Hongsheng Chen; Li Zhang; Qiaolu Chen; Yaqi Li; Yihao Yang

arxiv: 2605.22647 · v2 · pith:5ORWFXSLnew · submitted 2026-05-21 · ⚛️ physics.optics

Agentic metasurface design with self-correcting language-model systems

Bei Wu , Bo Xiong , Haiyao Luo , Yaqi Li , Li Zhang , Qiaolu Chen , Hongsheng Chen , Yihao Yang This is my paper

Pith reviewed 2026-05-25 02:32 UTC · model grok-4.3

classification ⚛️ physics.optics

keywords metasurface designlanguage modelsagentic systemsself-correctionmetalenshologramoptical design

0 comments

The pith

A language-model system with a dedicated verifier autonomously plans, executes, and repairs long chains of metasurface design tasks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents MetaDesigner, a language-model system that takes a natural-language optical objective and produces complete metasurface designs by planning routes, retrieving knowledge, invoking tools, generating code, and correcting its own errors. Demonstrations include an RGB metalens with three independent focal spots, a six-plane full-color hologram reaching 0.97 average SSIM, and an optoelectronic hybrid neural network for image style transfer. These tasks span 74 to 136 reasoning steps and involve self-correction of errors in frequency mapping, numerical aperture estimation, network-parameter counting, and loss-function description. The work shows that language-model systems can manage extended, error-prone optical design processes rather than being limited to simple or predefined pipelines.

Core claim

MetaDesigner establishes a self-correcting route to agentic metasurface design. From a natural-language optical objective, the system plans the design route, retrieves domain knowledge, invokes simulation and optimization tools, generates missing tool code, and identifies errors through a dedicated Verifier. It completes three tasks of increasing complexity—an RGB metalens, a six-plane full-color hologram, and an optoelectronic hybrid neural network—requiring 74, 136, and 90 reasoning steps respectively, while self-correcting errors in frequency mapping, numerical aperture estimation, network-parameter counting, and loss-function description.

What carries the argument

MetaDesigner, a language-model system equipped with a dedicated Verifier module that detects and corrects domain-specific errors across extended design chains.

If this is right

Language-model systems can carry out complete optical design workflows that include modeling, simulation, coding, optimization, and evaluation.
Self-correction allows reliable handling of errors that arise in design chains spanning dozens to over one hundred reasoning steps.
The approach supports metasurface applications such as metalenses, holography, and optical computing that exceed the scope of predefined pipelines or simple layout generation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same self-correcting structure could be tested on design problems in adjacent fields such as photonic integrated circuits or metamaterial optimization.
If the verifier continues to scale without introducing new errors, the method might enable iterative design loops that currently require repeated human oversight.
Integration with additional real-time simulation feedback could further shorten the number of reasoning steps needed for convergence.

Load-bearing premise

The dedicated Verifier module can reliably detect and correct domain-specific errors across design chains of 70–140 reasoning steps without introducing new uncorrected mistakes.

What would settle it

Give the system a metasurface design task containing known injected errors in frequency mapping or network-parameter counting, then check whether the verifier corrects every injected error without adding new uncorrected mistakes in the final output.

read the original abstract

Automated metasurface design is increasingly important, and recent advances in language-model systems are opening a route toward agentic optical design. Yet modern metasurface applications, from metalenses and holography to optical computing, require long design chains spanning modeling, simulation, coding, optimization and evaluation. These chains are error-prone, whereas existing language-model-based metasurface tools remain largely limited to simple objectives, predefined pipelines or language-to-layout generation. Here we introduce MetaDesigner, a self-correcting language-model system for agentic metasurface design. From a natural-language optical objective, MetaDesigner plans the design route, retrieves domain knowledge, invokes simulation and optimization tools, generates missing tool code and identifies errors through a dedicated Verifier. We demonstrate three tasks of increasing complexity: an RGB metalens with three independent focal spots, a six-plane full-color hologram with an average structural similarity index measure (SSIM) of 0.97, and an optoelectronic hybrid neural network for image style transfer. These tasks require 74, 136 and 90 reasoning steps, respectively, and the system self-corrects errors in frequency mapping, numerical aperture estimation, network-parameter counting and loss-function description. These results establish MetaDesigner as a self-correcting route to agentic metasurface design, where language-model systems can not only execute optical design tasks but also extend, inspect and repair the design process itself.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MetaDesigner adds self-correction to LLM-driven metasurface design chains, but the paper gives no counts or validation of how well the verifier actually works.

read the letter

The paper's core contribution is a system called MetaDesigner that lets a language model handle the full sequence of metasurface design steps—planning, tool calling, code generation, simulation, and error fixing—on tasks that run 74 to 136 steps long. It shows this on an RGB metalens, a six-plane hologram reaching SSIM 0.97, and a hybrid optoelectronic network for style transfer, with the model correcting specific mistakes in frequency mapping, NA estimation, parameter counts, and loss functions. That is the new piece relative to earlier LM optics tools that stayed inside fixed pipelines or simpler goals. The demonstrations are concrete enough to show the architecture can reach working designs on non-trivial problems. The soft spot is exactly where the stress-test note points: the abstract and description assert that the dedicated Verifier caught and repaired errors across those long chains, yet supply no totals for errors encountered, no success fraction, and no post-correction checks confirming the outputs were free of new or residual mistakes. Without those numbers or logs, the self-correction claim stays unmeasured. The rest of the work—tool integration, domain retrieval, and step counts—looks like standard agent scaffolding applied to optics. This paper is aimed at groups already building LLM agents for photonics or inverse design. A reader who wants to see whether agentic systems can close the loop on real optical workflows will find the examples useful to examine, even if the verifier performance still needs tighter data. It is worth sending to peer review so referees can check the methods section and any error logs that may exist beyond the abstract.

Referee Report

1 major / 2 minor

Summary. The manuscript introduces MetaDesigner, a self-correcting language-model agent system for metasurface design. Starting from natural-language objectives, the system plans design routes, retrieves domain knowledge, invokes simulation/optimization tools, generates code, and employs a dedicated Verifier module to detect and repair errors. Three demonstrations are presented: an RGB metalens with three focal spots, a six-plane full-color hologram (SSIM 0.97), and an optoelectronic hybrid neural network for style transfer. These require 74–136 reasoning steps and involve self-correction of specific errors (frequency mapping, NA estimation, parameter counting, loss-function description). The central claim is that this establishes a self-correcting route to agentic metasurface design.

Significance. If the Verifier's reliability across long chains is quantitatively established, the work would be significant for automating complex, multi-stage optical design tasks that exceed the scope of prior LM-based metasurface tools. The demonstrations of 70–140-step workflows and the reported SSIM=0.97 for the hologram constitute concrete progress; the explicit credit for machine-checked or reproducible elements is absent, but the task complexity itself is a strength.

major comments (1)

[Abstract / Results] Abstract and Results (demonstration sections): The central claim that the dedicated Verifier reliably detects and corrects domain-specific errors across 74–136-step chains without introducing new uncorrected mistakes is load-bearing, yet no quantitative metrics are supplied—neither total errors encountered, fraction successfully corrected, nor post-correction validation (e.g., residual-error audits or independent verification runs). This leaves the self-correction capability as an assertion rather than a measured outcome.

minor comments (2)

[Methods] Methods section: The description of how the Verifier interfaces with simulation tools and code-generation steps lacks sufficient detail on prompt templates, error taxonomy, or fallback mechanisms to allow reproduction.
[Figures] Figure captions: The hologram and metalens result figures would benefit from explicit annotation of which errors were corrected at which steps to link visuals directly to the self-correction narrative.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive review and recommendation of major revision. We address the single major comment point by point below.

read point-by-point responses

Referee: [Abstract / Results] Abstract and Results (demonstration sections): The central claim that the dedicated Verifier reliably detects and corrects domain-specific errors across 74–136-step chains without introducing new uncorrected mistakes is load-bearing, yet no quantitative metrics are supplied—neither total errors encountered, fraction successfully corrected, nor post-correction validation (e.g., residual-error audits or independent verification runs). This leaves the self-correction capability as an assertion rather than a measured outcome.

Authors: We agree that the manuscript presents the Verifier's operation through concrete, task-specific examples of error detection and correction rather than aggregate quantitative metrics such as total errors encountered or correction success rates. The demonstrations establish that the system completed long chains only after the described corrections, but this does not constitute a systematic audit. In the revised manuscript we will add a new subsection under Results that logs, for each demonstration, the errors flagged by the Verifier, the corrections implemented, and the outcome of subsequent validation runs (including any residual discrepancies). This will convert the current illustrative evidence into a measured evaluation. revision: yes

Circularity Check

0 steps flagged

No circularity: system demonstration without derivation chain

full rationale

The paper describes an implemented AI system (MetaDesigner) for metasurface design tasks and reports its performance on three specific examples (RGB metalens, hologram, hybrid neural network). No equations, fitted parameters, predictions, or first-principles derivations are present that could reduce to their own inputs. The self-correction capability is asserted via the Verifier module's behavior on the demonstrated chains, but this is an empirical outcome of the system run rather than a mathematical reduction or self-citation dependency. The reader's assessment of score 1.0 aligns with the absence of any load-bearing circular structure.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities; the system is described as building on existing language models and simulation tools.

pith-pipeline@v0.9.0 · 5798 in / 1104 out tokens · 32806 ms · 2026-05-25T02:32:14.897123+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 3 internal anchors

[1]

Enabling smart vision with metasurfaces

Neshev, D.N., & Miroshnichenko, A.E. Enabling smart vision with metasurfaces. Nat. Photon. 17, 26-35 (2023)

work page 2023
[2]

Programmable metasurfaces for future photonic artificial intelligence

Abou-Hamdan, L., et al . Programmable metasurfaces for future photonic artificial intelligence. Nat. Rev. Phys. 7, 331-347 (2025)

work page 2025
[3]

Switchable 2D -3D display through a metasurface lenticular lens

Moon, S., et al . Switchable 2D -3D display through a metasurface lenticular lens. Nature 652, 1181-1187 (2026)

work page 2026
[4]

3D metamaterials

Kadic, M., et al. 3D metamaterials. Nat. Rev. Phys. 1, 198-210 (2019)

work page 2019
[5]

Neural phase microscopy with metasurface optics for real- time and nanoscale quantitative phase imaging

Lee, GY., et al. Neural phase microscopy with metasurface optics for real- time and nanoscale quantitative phase imaging. Nat. Commun. 17, 1411 (2026)

work page 2026
[6]

Spatial light modulator via optically addressed metasurface

Fan, X., et al. Spatial light modulator via optically addressed metasurface. Nat. Nanotechnol. 21, 561-570 (2026)

work page 2026
[7]

Metasurface optics for on -demand polarization transformations along the optical path

Dorrah, A.H., et al. Metasurface optics for on -demand polarization transformations along the optical path. Nat. Photon. 15, 287-296 (2021)

work page 2021
[8]

Breaking the limitation of polarization multiplexing in optical metasurfaces with engineered noise

Xiong, B., et al. Breaking the limitation of polarization multiplexing in optical metasurfaces with engineered noise. Science 379, 294-299 (2023)

work page 2023
[9]

Complex-amplitude metasurface-based orbital angular momentum holography in momentum space

Ren, H., et al. Complex-amplitude metasurface-based orbital angular momentum holography in momentum space. Nat. Nanotechnol. 15, 948-955 (2020)

work page 2020
[10]

Discontinuous orbital angular momentum metasurface holography

Gao, X., et al. Discontinuous orbital angular momentum metasurface holography. Nat. Commun. 16, 10688 (2025)

work page 2025
[11]

Planar metasurface retroreflector

Arbabi, A., et al. Planar metasurface retroreflector. Nat. Photon. 11, 415-420 (2017)

work page 2017
[12]

A broadband achromatic metalens in the visible

Wang, S., et al. A broadband achromatic metalens in the visible. Nat. Nanotechnol. 13, 227-232 (2018)

work page 2018
[13]

Mid-infrared polarization-controlled broadband achromatic metadevice

Ou, K., et al. Mid-infrared polarization-controlled broadband achromatic metadevice. Sci. Adv. 6, eabc0711 (2020)

work page 2020
[14]

Towards real- time photorealistic 3D holography with deep neural networks

Shi, L., Li, B., Kim, C., Kellnhofer, P., & Matusik, W. Towards real- time photorealistic 3D holography with deep neural networks. Nature 591, 234-239 (2021). 16

work page 2021
[15]

Full-colour 3D holographic augmented -reality displays with metasurface waveguides

Gopakumar, M., et al. Full-colour 3D holographic augmented -reality displays with metasurface waveguides. Nature 629, 791-797 (2024)

work page 2024
[16]

Propagation -adaptive 4K computer -generated holography using physics -constrained spatial and Fourier neural operator

Liu, N., Liu, K., Yang, Y., Peng, Y., & Cao, L. Propagation -adaptive 4K computer -generated holography using physics -constrained spatial and Fourier neural operator. Nat . Commun. 16, 7761 (2025)

work page 2025
[17]

All-optical machine learning using diffractive deep neural networks

Lin, X., et al . All-optical machine learning using diffractive deep neural networks. Science 361, 1004-1008 (2018)

work page 2018
[18]

All- optical image transportation through a multimode fibre using a miniaturized diffractive neural network on the distal facet

Yu, H., et al. All- optical image transportation through a multimode fibre using a miniaturized diffractive neural network on the distal facet. Nat. Photon. 19, 486-493 (2025)

work page 2025
[19]

All-optical synthesis chip for large -scale intelligent semantic vision generation

Chen, Y., et al. All-optical synthesis chip for large -scale intelligent semantic vision generation. Science 390, 1259-1265 (2025)

work page 2025
[20]

& Huang, X

Li, W., Meng, F., Chen, Y., Li, Y. & Huang, X. Topology optimization of photonic and phononic crystals and metamaterials: a review. Adv. Theory Simul. 2, 1900017 (2019)

work page 2019
[21]

Hegde, R. S. Photonics inverse design: pairing deep neural networks with evolutionary algorithms. IEEE J. Sel. Top. Quant. Electron. 26, https://doi.org/10.1109/JSTQE.2019.2933796 (2019)

work page doi:10.1109/jstqe.2019.2933796 2019
[22]

Deep learning for the design of photonic structures

Ma, W., et al. Deep learning for the design of photonic structures. Nat. Photon. 15, 77-90 (2021)

work page 2021
[23]

MetaSeeker: sketching an open invisible space with self-play reinforcement learning

Wu, B., et al. MetaSeeker: sketching an open invisible space with self-play reinforcement learning. Light: Sci. Appl. 14, 211 (2025)

work page 2025
[24]

Towards generalizable AI in medicine via Generalist -Specialist Collaboration

He, S., et al. Towards generalizable AI in medicine via Generalist -Specialist Collaboration. Nat. Biomed. Eng. (2026)

work page 2026
[25]

An agentic framework for autonomous scientific discovery in cancer pathology

Trost, F., et al. An agentic framework for autonomous scientific discovery in cancer pathology. Nat. Med. (2026)

work page 2026
[26]

A multi-agent framework combining large language models with medical flowcharts for self-triage

Liu, Y., et al. A multi-agent framework combining large language models with medical flowcharts for self-triage. Nat. Health (2026)

work page 2026
[27]

Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses

Bu, D., et al. Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses. Nat. Biomed. Eng. (2026)

work page 2026
[28]

Pan, Y. , et al. Multi -agent artificial intelligence designs novel catalysts for ultrafast water purification. Nat. Water (2026)

work page 2026
[29]

A robot operating system framework for using large language models in embodied AI

Mower, C.E., et al. A robot operating system framework for using large language models in embodied AI. Nat. Mach. Intell. 8, 313-325 (2026)

work page 2026
[30]

A multi-agentic framework for real- time, autonomous freeform metasurface design

Lupoiu, R., et al. A multi-agentic framework for real- time, autonomous freeform metasurface design. Sci. Adv. 11, eadx8006 (2025)

work page 2025
[31]

End-to-end autonomous scientific discovery on a real optical platform

Yang, S., et al. End-to-end autonomous scientific discovery on a real optical platform. Preprint at arXiv https://doi.org/10.48550/arXiv.2604.27092 (2026)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.27092 2026
[32]

Agent0 -vl: Exploring self -evolving agent for tool -integrated vision -language reasoning

Liu, J., et al. Agent0 -vl: Exploring self -evolving agent for tool -integrated vision -language reasoning. Preprint at arXiv https://doi.org/10.48550/arXiv.2511.19900 (2025)

work page doi:10.48550/arxiv.2511.19900 2025
[33]

DeepSeek-V3 Technical Report

Liu, A., et al. Deepseek -v3 technical report. Preprint at arXiv https://doi.org/10.48550/arXiv.2412.19437 (2024)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2412.19437 2024
[34]

Fully forward mode training for optical neural networks

Xue, Z., et al. Fully forward mode training for optical neural networks. Nature 632, 280 -286 (2024)

work page 2024
[35]

Retrieval-augmented generation for knowledge-intensive NLP tasks

Lewis, P., et al. Retrieval-augmented generation for knowledge-intensive NLP tasks. Adv. Neural Inf. Process. Syst. 33, 9459-9474 (2020)

work page 2020
[36]

Optical generative models

Chen, S., Li, Y., Wang, Y., Chen, H., & Ozcan, A. Optical generative models. Nature 644, 903-911 (2025)

work page 2025
[37]

All-optical synthesis chip for large -scale intelligent semantic vision generation

Chen, Y., et al. All-optical synthesis chip for large -scale intelligent semantic vision generation. Science 390, 1259-1265 (2025). 17

work page 2025
[38]

Goodman, J. W. Introduction to Fourier Optics. 3rd edn. (Roberts & Company Publishers, Englewood, 2005)

work page 2005
[39]

Deep learning design for multiwavelength infrared image sensors based on dielectric freeform metasurface

Xiong, B., et al. Deep learning design for multiwavelength infrared image sensors based on dielectric freeform metasurface. Adv. Opt. Mater. 12, 2302200 (2024)

work page 2024
[40]

High-Efficiency Nanophotonic Spectral Router for Multispectral Imaging Enabled by Hierarchical Parametrization-Based Inverse Design

Wei, W., et al. High-Efficiency Nanophotonic Spectral Router for Multispectral Imaging Enabled by Hierarchical Parametrization-Based Inverse Design. ACS Photon. 13, 2712-2720 (2026)

work page 2026
[41]

A broadband achromatic metalens for focusing and imaging in the visible

Chen, W.T., et al. A broadband achromatic metalens for focusing and imaging in the visible. Nat. Nanotech. 13, 220-226 (2018)

work page 2018
[42]

Achromatic metalenses for full visible spectrum with extended group delay control via dispersion-matched layers

Chang, S., et al. Achromatic metalenses for full visible spectrum with extended group delay control via dispersion-matched layers. Nat. Commun. 15, 9627 (2024)

work page 2024
[43]

3D -printed aberration-free terahertz metalens for ultra -broadband achromatic super-resolution wide-angle imaging with high numerical aperture

Chen, J., et al. 3D -printed aberration-free terahertz metalens for ultra -broadband achromatic super-resolution wide-angle imaging with high numerical aperture. Nat. Commun. 16, 363 (2025)

work page 2025
[44]

Backpropagation-free training of deep physical neural networks

Momeni, A., Rahmani, B., Malléjac, M., Del Hougne, P., & Fleury, R. Backpropagation-free training of deep physical neural networks. Science 382, 1297-1303 (2023)

work page 2023
[45]

Nonlinear optical encoding enabled by recurrent linear scattering

Xia, F., et al. Nonlinear optical encoding enabled by recurrent linear scattering. Nat. Photon. 18, 1067-1075 (2024)

work page 2024
[46]

K., & Huang, C

Wang, D., Nie, Y., Hu, G., Tsang, H. K., & Huang, C. Ultrafast silicon photonic reservoir computing engine delivering over 200 TOPS. Nat. Commun. 15, 10841 (2024)

work page 2024
[47]

Time -synthetic optical neural networks with stable programmable gain

Wu, B., et al. Time -synthetic optical neural networks with stable programmable gain. Nat. Commun. (2026)

work page 2026
[48]

Very Deep Convolutional Networks for Large-Scale Image Recognition

Simonyan, K., & Zisserman, A. Very deep convolutional networks for large -scale image recognition. Preprint at arXiv https://doi.org/10.48550/arXiv.1409.1556 (2014). Acknowledgments Y.Y. discloses support for the research of this work from the Key Research and Development Program of the Ministry of Science and Technology [grant number s 2022YFA1405200 and...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1409.1556 2014

[1] [1]

Enabling smart vision with metasurfaces

Neshev, D.N., & Miroshnichenko, A.E. Enabling smart vision with metasurfaces. Nat. Photon. 17, 26-35 (2023)

work page 2023

[2] [2]

Programmable metasurfaces for future photonic artificial intelligence

Abou-Hamdan, L., et al . Programmable metasurfaces for future photonic artificial intelligence. Nat. Rev. Phys. 7, 331-347 (2025)

work page 2025

[3] [3]

Switchable 2D -3D display through a metasurface lenticular lens

Moon, S., et al . Switchable 2D -3D display through a metasurface lenticular lens. Nature 652, 1181-1187 (2026)

work page 2026

[4] [4]

3D metamaterials

Kadic, M., et al. 3D metamaterials. Nat. Rev. Phys. 1, 198-210 (2019)

work page 2019

[5] [5]

Neural phase microscopy with metasurface optics for real- time and nanoscale quantitative phase imaging

Lee, GY., et al. Neural phase microscopy with metasurface optics for real- time and nanoscale quantitative phase imaging. Nat. Commun. 17, 1411 (2026)

work page 2026

[6] [6]

Spatial light modulator via optically addressed metasurface

Fan, X., et al. Spatial light modulator via optically addressed metasurface. Nat. Nanotechnol. 21, 561-570 (2026)

work page 2026

[7] [7]

Metasurface optics for on -demand polarization transformations along the optical path

Dorrah, A.H., et al. Metasurface optics for on -demand polarization transformations along the optical path. Nat. Photon. 15, 287-296 (2021)

work page 2021

[8] [8]

Breaking the limitation of polarization multiplexing in optical metasurfaces with engineered noise

Xiong, B., et al. Breaking the limitation of polarization multiplexing in optical metasurfaces with engineered noise. Science 379, 294-299 (2023)

work page 2023

[9] [9]

Complex-amplitude metasurface-based orbital angular momentum holography in momentum space

Ren, H., et al. Complex-amplitude metasurface-based orbital angular momentum holography in momentum space. Nat. Nanotechnol. 15, 948-955 (2020)

work page 2020

[10] [10]

Discontinuous orbital angular momentum metasurface holography

Gao, X., et al. Discontinuous orbital angular momentum metasurface holography. Nat. Commun. 16, 10688 (2025)

work page 2025

[11] [11]

Planar metasurface retroreflector

Arbabi, A., et al. Planar metasurface retroreflector. Nat. Photon. 11, 415-420 (2017)

work page 2017

[12] [12]

A broadband achromatic metalens in the visible

Wang, S., et al. A broadband achromatic metalens in the visible. Nat. Nanotechnol. 13, 227-232 (2018)

work page 2018

[13] [13]

Mid-infrared polarization-controlled broadband achromatic metadevice

Ou, K., et al. Mid-infrared polarization-controlled broadband achromatic metadevice. Sci. Adv. 6, eabc0711 (2020)

work page 2020

[14] [14]

Towards real- time photorealistic 3D holography with deep neural networks

Shi, L., Li, B., Kim, C., Kellnhofer, P., & Matusik, W. Towards real- time photorealistic 3D holography with deep neural networks. Nature 591, 234-239 (2021). 16

work page 2021

[15] [15]

Full-colour 3D holographic augmented -reality displays with metasurface waveguides

Gopakumar, M., et al. Full-colour 3D holographic augmented -reality displays with metasurface waveguides. Nature 629, 791-797 (2024)

work page 2024

[16] [16]

Propagation -adaptive 4K computer -generated holography using physics -constrained spatial and Fourier neural operator

Liu, N., Liu, K., Yang, Y., Peng, Y., & Cao, L. Propagation -adaptive 4K computer -generated holography using physics -constrained spatial and Fourier neural operator. Nat . Commun. 16, 7761 (2025)

work page 2025

[17] [17]

All-optical machine learning using diffractive deep neural networks

Lin, X., et al . All-optical machine learning using diffractive deep neural networks. Science 361, 1004-1008 (2018)

work page 2018

[18] [18]

All- optical image transportation through a multimode fibre using a miniaturized diffractive neural network on the distal facet

Yu, H., et al. All- optical image transportation through a multimode fibre using a miniaturized diffractive neural network on the distal facet. Nat. Photon. 19, 486-493 (2025)

work page 2025

[19] [19]

All-optical synthesis chip for large -scale intelligent semantic vision generation

Chen, Y., et al. All-optical synthesis chip for large -scale intelligent semantic vision generation. Science 390, 1259-1265 (2025)

work page 2025

[20] [20]

& Huang, X

Li, W., Meng, F., Chen, Y., Li, Y. & Huang, X. Topology optimization of photonic and phononic crystals and metamaterials: a review. Adv. Theory Simul. 2, 1900017 (2019)

work page 2019

[21] [21]

Hegde, R. S. Photonics inverse design: pairing deep neural networks with evolutionary algorithms. IEEE J. Sel. Top. Quant. Electron. 26, https://doi.org/10.1109/JSTQE.2019.2933796 (2019)

work page doi:10.1109/jstqe.2019.2933796 2019

[22] [22]

Deep learning for the design of photonic structures

Ma, W., et al. Deep learning for the design of photonic structures. Nat. Photon. 15, 77-90 (2021)

work page 2021

[23] [23]

MetaSeeker: sketching an open invisible space with self-play reinforcement learning

Wu, B., et al. MetaSeeker: sketching an open invisible space with self-play reinforcement learning. Light: Sci. Appl. 14, 211 (2025)

work page 2025

[24] [24]

Towards generalizable AI in medicine via Generalist -Specialist Collaboration

He, S., et al. Towards generalizable AI in medicine via Generalist -Specialist Collaboration. Nat. Biomed. Eng. (2026)

work page 2026

[25] [25]

An agentic framework for autonomous scientific discovery in cancer pathology

Trost, F., et al. An agentic framework for autonomous scientific discovery in cancer pathology. Nat. Med. (2026)

work page 2026

[26] [26]

A multi-agent framework combining large language models with medical flowcharts for self-triage

Liu, Y., et al. A multi-agent framework combining large language models with medical flowcharts for self-triage. Nat. Health (2026)

work page 2026

[27] [27]

Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses

Bu, D., et al. Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses. Nat. Biomed. Eng. (2026)

work page 2026

[28] [28]

Pan, Y. , et al. Multi -agent artificial intelligence designs novel catalysts for ultrafast water purification. Nat. Water (2026)

work page 2026

[29] [29]

A robot operating system framework for using large language models in embodied AI

Mower, C.E., et al. A robot operating system framework for using large language models in embodied AI. Nat. Mach. Intell. 8, 313-325 (2026)

work page 2026

[30] [30]

A multi-agentic framework for real- time, autonomous freeform metasurface design

Lupoiu, R., et al. A multi-agentic framework for real- time, autonomous freeform metasurface design. Sci. Adv. 11, eadx8006 (2025)

work page 2025

[31] [31]

End-to-end autonomous scientific discovery on a real optical platform

Yang, S., et al. End-to-end autonomous scientific discovery on a real optical platform. Preprint at arXiv https://doi.org/10.48550/arXiv.2604.27092 (2026)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2604.27092 2026

[32] [32]

Agent0 -vl: Exploring self -evolving agent for tool -integrated vision -language reasoning

Liu, J., et al. Agent0 -vl: Exploring self -evolving agent for tool -integrated vision -language reasoning. Preprint at arXiv https://doi.org/10.48550/arXiv.2511.19900 (2025)

work page doi:10.48550/arxiv.2511.19900 2025

[33] [33]

DeepSeek-V3 Technical Report

Liu, A., et al. Deepseek -v3 technical report. Preprint at arXiv https://doi.org/10.48550/arXiv.2412.19437 (2024)

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2412.19437 2024

[34] [34]

Fully forward mode training for optical neural networks

Xue, Z., et al. Fully forward mode training for optical neural networks. Nature 632, 280 -286 (2024)

work page 2024

[35] [35]

Retrieval-augmented generation for knowledge-intensive NLP tasks

Lewis, P., et al. Retrieval-augmented generation for knowledge-intensive NLP tasks. Adv. Neural Inf. Process. Syst. 33, 9459-9474 (2020)

work page 2020

[36] [36]

Optical generative models

Chen, S., Li, Y., Wang, Y., Chen, H., & Ozcan, A. Optical generative models. Nature 644, 903-911 (2025)

work page 2025

[37] [37]

All-optical synthesis chip for large -scale intelligent semantic vision generation

Chen, Y., et al. All-optical synthesis chip for large -scale intelligent semantic vision generation. Science 390, 1259-1265 (2025). 17

work page 2025

[38] [38]

Goodman, J. W. Introduction to Fourier Optics. 3rd edn. (Roberts & Company Publishers, Englewood, 2005)

work page 2005

[39] [39]

Deep learning design for multiwavelength infrared image sensors based on dielectric freeform metasurface

Xiong, B., et al. Deep learning design for multiwavelength infrared image sensors based on dielectric freeform metasurface. Adv. Opt. Mater. 12, 2302200 (2024)

work page 2024

[40] [40]

High-Efficiency Nanophotonic Spectral Router for Multispectral Imaging Enabled by Hierarchical Parametrization-Based Inverse Design

Wei, W., et al. High-Efficiency Nanophotonic Spectral Router for Multispectral Imaging Enabled by Hierarchical Parametrization-Based Inverse Design. ACS Photon. 13, 2712-2720 (2026)

work page 2026

[41] [41]

A broadband achromatic metalens for focusing and imaging in the visible

Chen, W.T., et al. A broadband achromatic metalens for focusing and imaging in the visible. Nat. Nanotech. 13, 220-226 (2018)

work page 2018

[42] [42]

Achromatic metalenses for full visible spectrum with extended group delay control via dispersion-matched layers

Chang, S., et al. Achromatic metalenses for full visible spectrum with extended group delay control via dispersion-matched layers. Nat. Commun. 15, 9627 (2024)

work page 2024

[43] [43]

3D -printed aberration-free terahertz metalens for ultra -broadband achromatic super-resolution wide-angle imaging with high numerical aperture

Chen, J., et al. 3D -printed aberration-free terahertz metalens for ultra -broadband achromatic super-resolution wide-angle imaging with high numerical aperture. Nat. Commun. 16, 363 (2025)

work page 2025

[44] [44]

Backpropagation-free training of deep physical neural networks

Momeni, A., Rahmani, B., Malléjac, M., Del Hougne, P., & Fleury, R. Backpropagation-free training of deep physical neural networks. Science 382, 1297-1303 (2023)

work page 2023

[45] [45]

Nonlinear optical encoding enabled by recurrent linear scattering

Xia, F., et al. Nonlinear optical encoding enabled by recurrent linear scattering. Nat. Photon. 18, 1067-1075 (2024)

work page 2024

[46] [46]

K., & Huang, C

Wang, D., Nie, Y., Hu, G., Tsang, H. K., & Huang, C. Ultrafast silicon photonic reservoir computing engine delivering over 200 TOPS. Nat. Commun. 15, 10841 (2024)

work page 2024

[47] [47]

Time -synthetic optical neural networks with stable programmable gain

Wu, B., et al. Time -synthetic optical neural networks with stable programmable gain. Nat. Commun. (2026)

work page 2026

[48] [48]

Very Deep Convolutional Networks for Large-Scale Image Recognition

Simonyan, K., & Zisserman, A. Very deep convolutional networks for large -scale image recognition. Preprint at arXiv https://doi.org/10.48550/arXiv.1409.1556 (2014). Acknowledgments Y.Y. discloses support for the research of this work from the Key Research and Development Program of the Ministry of Science and Technology [grant number s 2022YFA1405200 and...

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1409.1556 2014