TY - JOUR
T1 - Photonic modes prediction via multi-modal diffusion model
AU - Sun, Jinyang
AU - Chen, Xi
AU - Wang, Xiumei
AU - Zhu, Dandan
AU - Zhou, Xingping
N1 - Publisher Copyright:
© 2024 The Author(s). Published by IOP Publishing Ltd.
PY - 2024/9/1
Y1 - 2024/9/1
N2 - The concept of photonic modes is the cornerstone in optics and photonics, which can describe the propagation of the light. The Maxwell’s equations play the role in calculating the mode field based on the structure information, while this process needs a great deal of computations, especially in the handle with a three-dimensional model. To overcome this obstacle, we introduce the multi-modal diffusion model to predict the photonic modes in one certain structure. The Contrastive Language-Image Pre-training (CLIP) model is used to build the connections between photonic structures and the corresponding modes. Then we exemplify Stable Diffusion (SD) model to realize the function of optical fields generation from structure information. Our work introduces multi-modal deep learning to construct complex mapping between structural information and optical field as high-dimensional vectors, and generates optical field images based on this mapping.
AB - The concept of photonic modes is the cornerstone in optics and photonics, which can describe the propagation of the light. The Maxwell’s equations play the role in calculating the mode field based on the structure information, while this process needs a great deal of computations, especially in the handle with a three-dimensional model. To overcome this obstacle, we introduce the multi-modal diffusion model to predict the photonic modes in one certain structure. The Contrastive Language-Image Pre-training (CLIP) model is used to build the connections between photonic structures and the corresponding modes. Then we exemplify Stable Diffusion (SD) model to realize the function of optical fields generation from structure information. Our work introduces multi-modal deep learning to construct complex mapping between structural information and optical field as high-dimensional vectors, and generates optical field images based on this mapping.
KW - contrastive language-image pre-training (CLIP)
KW - multi-modal diffusion model
KW - optical field generation
KW - photonic modes
KW - stable diffusion (SD) model
UR - https://www.scopus.com/pages/publications/85203258799
U2 - 10.1088/2632-2153/ad743f
DO - 10.1088/2632-2153/ad743f
M3 - 文章
AN - SCOPUS:85203258799
SN - 2632-2153
VL - 5
JO - Machine Learning: Science and Technology
JF - Machine Learning: Science and Technology
IS - 3
M1 - 035069
ER -