TY - GEN
T1 - Hybrid Controller Synthesis for Nonlinear Systems Subject to Reach-Avoid Constraints
AU - Yang, Zhengfeng
AU - Zhang, Li
AU - Zeng, Xia
AU - Tang, Xiaochao
AU - Peng, Chao
AU - Zeng, Zhenbing
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2023
Y1 - 2023
N2 - There is a pressing need for learning controllers to endow systems with properties of safety and goal-reaching, which are crucial for many safety-critical systems. Reinforcement learning (RL) has been deployed successfully to synthesize controllers from user-defined reward functions encoding desired system requirements. However, it remains a significant challenge in synthesizing provably correct controllers with safety and goal-reaching requirements. To address this issue, we try to design a special hybrid polynomial-DNN controller which is easy to verify without losing its expressiveness and flexibility. This paper proposes a novel method to synthesize such a hybrid controller based on RL, low-degree polynomial fitting and knowledge distillation. It also gives a computational approach, by building and solving a constrained optimization problem coming from verification conditions to produce barrier certificates and Lyapunov-like functions, which can guarantee every trajectory from the initial set of the system with the resulted controller satisfies the given safety and goal-reaching requirements. We evaluate the proposed hybrid controller synthesis method on a set of benchmark examples, including several high-dimensional systems. The results validate the effectiveness and applicability of our approach.
AB - There is a pressing need for learning controllers to endow systems with properties of safety and goal-reaching, which are crucial for many safety-critical systems. Reinforcement learning (RL) has been deployed successfully to synthesize controllers from user-defined reward functions encoding desired system requirements. However, it remains a significant challenge in synthesizing provably correct controllers with safety and goal-reaching requirements. To address this issue, we try to design a special hybrid polynomial-DNN controller which is easy to verify without losing its expressiveness and flexibility. This paper proposes a novel method to synthesize such a hybrid controller based on RL, low-degree polynomial fitting and knowledge distillation. It also gives a computational approach, by building and solving a constrained optimization problem coming from verification conditions to produce barrier certificates and Lyapunov-like functions, which can guarantee every trajectory from the initial set of the system with the resulted controller satisfies the given safety and goal-reaching requirements. We evaluate the proposed hybrid controller synthesis method on a set of benchmark examples, including several high-dimensional systems. The results validate the effectiveness and applicability of our approach.
KW - Barrier certificate
KW - Controller synthesis
KW - Formal verification
KW - Lyapunov-like function
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/85169025232
U2 - 10.1007/978-3-031-37706-8_16
DO - 10.1007/978-3-031-37706-8_16
M3 - 会议稿件
AN - SCOPUS:85169025232
SN - 9783031377051
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 304
EP - 325
BT - Computer Aided Verification - 35th International Conference, CAV 2023, Proceedings
A2 - Enea, Constantin
A2 - Lal, Akash
PB - Springer Science and Business Media Deutschland GmbH
T2 - 35th International Conference on Computer Aided Verification, CAV 2023
Y2 - 17 July 2023 through 22 July 2023
ER -