TY - GEN
T1 - Safe Controller Synthesis for Nonlinear Systems via Reinforcement Learning and PAC Approximation
AU - Zeng, Xia
AU - Liu, Banglong
AU - Zeng, Zhenbing
AU - Liu, Zhiming
AU - Yang, Zhengfeng
N1 - Publisher Copyright:
© 2024 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
PY - 2024/11/7
Y1 - 2024/11/7
N2 - Controller synthesis for nonlinear systems is an important research issue. Deep Neural Network (DNN) control policies obtained through reinforcement learning (RL), though exhibiting good performance in simulations, cannot be applied to safety-critical systems for lack of formal guarantee. To address this, this paper considers fully utilizing the advantages of RL for complex control tasks to obtain a well-performing DNN controller. Then, using PAC (Probably Approximately Correct) techniques, a polynomial surrogate controller with probabilistically controllable approximation error is obtained. Finally, the safety of the control system under the designed polynomial controller is verified using barrier certificate generation. Experiments demonstrate the effectiveness of our method in generating controllers with safety guarantees for systems with high dimensions and degrees.
AB - Controller synthesis for nonlinear systems is an important research issue. Deep Neural Network (DNN) control policies obtained through reinforcement learning (RL), though exhibiting good performance in simulations, cannot be applied to safety-critical systems for lack of formal guarantee. To address this, this paper considers fully utilizing the advantages of RL for complex control tasks to obtain a well-performing DNN controller. Then, using PAC (Probably Approximately Correct) techniques, a polynomial surrogate controller with probabilistically controllable approximation error is obtained. Finally, the safety of the control system under the designed polynomial controller is verified using barrier certificate generation. Experiments demonstrate the effectiveness of our method in generating controllers with safety guarantees for systems with high dimensions and degrees.
KW - Barrier certificate
KW - Controller synthesis
KW - Formal verification
KW - Probably Approximately Correct
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/85211100864
U2 - 10.1145/3649329.3657332
DO - 10.1145/3649329.3657332
M3 - 会议稿件
AN - SCOPUS:85211100864
T3 - Proceedings - Design Automation Conference
BT - Proceedings of the 61st ACM/IEEE Design Automation Conference, DAC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 61st ACM/IEEE Design Automation Conference, DAC 2024
Y2 - 23 June 2024 through 27 June 2024
ER -