Skip to main navigation Skip to search Skip to main content

End-to-end hardware-in-the-loop temperature control for semiconductor laser based on deep reinforcement learning

  • East China Normal University
  • Nanjing University of Aeronautics and Astronautics

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a temperature control system for semiconductor lasers, employing a novel end-to-end hardware-in-the-loop control strategy that integrates deep reinforcement learning control with mechanical structure design. Three mechanical structures were proposed and simulated to enhance temperature control flexibility and uniformity: single-stage (Fan only, thermoelectric cooler (TEC) only) and dual-stage (TEC + Fan). Simulations demonstrate that the dual-stage structure under the same driving conditions provides superior temperature control flexibility and uniformity. The key innovation lies in the end-to-end hardware-in-the-loop control strategy based on deep reinforcement learning. The End-to-end Deep Reinforcement Learning (E2EDRL) algorithm is capable of autonomously exploring optimal control policies without manual tuning. Simulation results demonstrate that, compared with the Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) algorithms, E2EDRL not only achieves the highest performance metrics but also converges within a reasonable number of episodes (approximately 45). Compared with Proportional-Integral-Derivative (PID), E2EDRL achieves approximately 50% improvement in settling time, overshoot, and steady-state error. This approach achieves structure–algorithm synergy, thereby comprehensively accounting for system-level factors. Experimental results demonstrate substantial performance improvements, achieving temperature fluctuation control within ±0.8 °C, optical power fluctuations limited to 2.2%, and an 87% reduction in central wavelength redshift. Furthermore, the system demonstrates robust intelligent behavior and adaptability across a broad range of temperature control scenarios, underscoring its potential for advanced thermal management applications in semiconductor lasers.

Original languageEnglish
JournalEngineering Research Express
Volume8
Issue number9
DOIs
StatePublished - May 2026

Keywords

  • hardware-in-the-loop
  • online learning
  • reinforcement learning
  • semiconductor laser
  • temperature control

Fingerprint

Dive into the research topics of 'End-to-end hardware-in-the-loop temperature control for semiconductor laser based on deep reinforcement learning'. Together they form a unique fingerprint.

Cite this