跳到主要导航 跳到搜索 跳到主要内容

Safe Reinforcement Learning for CPSs via Formal Modeling and Verification

  • Chenchen Yang
  • , Jing Liu*
  • , Haiying Sun
  • , Junfeng Sun
  • , Xiang Chen
  • , Lipeng Zhang*
  • *此作品的通讯作者
  • East China Normal University
  • Casco Signal Ltd

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement learning (RL) can be defined as the process of learning policies that maximize the expectation of the rewards. It has shown success in solving complex decision-making tasks. However, reinforcement learning-based controllers do not provide guarantees of safety of physical models in Cyber-physical systems (CPSs). In this paper, we propose a framework, which allows implementing RL to the safe control system by transforming formal analysis to learned policy. For satisfaction verification and quantitative analysis, we propose an uncertainty modeling language CSML to describe behaviors of the system, and transform CSML design into networks of probabilistic timed automata (NPTA). For safe learning, we present an algorithm called Safe Control with Formal Methods (SCFM). SCFM constructs a state set that obeys the constraint described by probabilistic computation tree logic (PCTL) via exploring state space before the learning process. The monitor monitors the system, determines whether the chosen action is safe and corrects unsafe decisions. We validate our method through experiments of lane-change control for autonomous cars.

源语言英语
主期刊名IJCNN 2021 - International Joint Conference on Neural Networks, Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9780738133669
DOI
出版状态已出版 - 18 7月 2021
活动2021 International Joint Conference on Neural Networks, IJCNN 2021 - Virtual, Online, 中国
期限: 18 7月 202122 7月 2021

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks
2021-July
ISSN(印刷版)2161-4393
ISSN(电子版)2161-4407

会议

会议2021 International Joint Conference on Neural Networks, IJCNN 2021
国家/地区中国
Virtual, Online
时期18/07/2122/07/21

指纹

探究 'Safe Reinforcement Learning for CPSs via Formal Modeling and Verification' 的科研主题。它们共同构成独一无二的指纹。

引用此