Abstract
This paper presents a temperature control system for semiconductor lasers, employing a novel end-to-end hardware-in-the-loop control strategy that integrates deep reinforcement learning control with mechanical structure design. Three mechanical structures were proposed and simulated to enhance temperature control flexibility and uniformity: single-stage (Fan only, thermoelectric cooler (TEC) only) and dual-stage (TEC + Fan). Simulations demonstrate that the dual-stage structure under the same driving conditions provides superior temperature control flexibility and uniformity. The key innovation lies in the end-to-end hardware-in-the-loop control strategy based on deep reinforcement learning. The End-to-end Deep Reinforcement Learning (E2EDRL) algorithm is capable of autonomously exploring optimal control policies without manual tuning. Simulation results demonstrate that, compared with the Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) algorithms, E2EDRL not only achieves the highest performance metrics but also converges within a reasonable number of episodes (approximately 45). Compared with Proportional-Integral-Derivative (PID), E2EDRL achieves approximately 50% improvement in settling time, overshoot, and steady-state error. This approach achieves structure–algorithm synergy, thereby comprehensively accounting for system-level factors. Experimental results demonstrate substantial performance improvements, achieving temperature fluctuation control within ±0.8 °C, optical power fluctuations limited to 2.2%, and an 87% reduction in central wavelength redshift. Furthermore, the system demonstrates robust intelligent behavior and adaptability across a broad range of temperature control scenarios, underscoring its potential for advanced thermal management applications in semiconductor lasers.
| Original language | English |
|---|---|
| Journal | Engineering Research Express |
| Volume | 8 |
| Issue number | 9 |
| DOIs | |
| State | Published - May 2026 |
Keywords
- hardware-in-the-loop
- online learning
- reinforcement learning
- semiconductor laser
- temperature control
Fingerprint
Dive into the research topics of 'End-to-end hardware-in-the-loop temperature control for semiconductor laser based on deep reinforcement learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver