跳到主要导航 跳到搜索 跳到主要内容

Safe Reinforcement Learning for NN-Controlled Systems with Neural Barrier Certificate Guidance

  • Hanrui Zhao
  • , Mengxin Ren
  • , Banglong Liu
  • , Niuniu Qi
  • , Xia Zeng
  • , Zhenbing Zeng
  • , Zhengfeng Yang*
  • *此作品的通讯作者
  • East China Normal University
  • Southwest University
  • Shanghai University

科研成果: 期刊稿件文章同行评审

摘要

Safe controller synthesis is crucial for safety-critical applications. This article presents a novel reinforcement learning (RL) approach to synthesize safe controllers for neural network (NN)-controlled systems. The core idea leverages an iterative scheme that combines controller learning with neural barrier certificate (BC) verification, ultimately producing a provably safe deep neural network (DNN) controller with formal safety guarantees. The process begins by pretraining a well-performing DNN controller as an 'oracle' via deep RL (DRL). To formally verify the safety properties of the closed-loop system under the base controller, we devise a formal verification procedure that approximates the DNN controller using polynomial inclusion, followed by synthesizing neural BCs via sum-of-squares (SOS) relaxation. In cases where the base controller is insufficient to yield a real BC, the current spurious BC is incorporated as an additional penalty term to reshape the RL reward function, guiding the iterative refinement for new controllers. We implement an automated tool, neural BC-guided safe RL NBCRL, and experimental results demonstrate the benefits of our method in terms of efficiency and scalability even for a nonlinear system with a dimension up to 12.

源语言英语
页(从-至)2460-2473
页数14
期刊IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
45
5
DOI
出版状态已出版 - 1 5月 2026

指纹

探究 'Safe Reinforcement Learning for NN-Controlled Systems with Neural Barrier Certificate Guidance' 的科研主题。它们共同构成独一无二的指纹。

引用此