TY - JOUR
T1 - Robust Motion Planning for Multi-Robot Systems Against Position Deception Attacks
AU - Tang, Wenbing
AU - Zhou, Yuan
AU - Liu, Yang
AU - Ding, Zuohua
AU - Liu, Jing
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Deep reinforcement learning (DRL) is widely applied in motion planning for multi-robot systems as DRL leverages the offline training process to improve the real-time computation efficiency. In DRL-based methods, the DRL models compute an action for a robot based on the states of its surrounding obstacles, including other robots in the system. They always assume that the number of obstacles is fixed and the obtained obstacles' states are reliable. However, in the real world, a multi-robot system may suffer from various attacks, such as remote control attacks and network attacks, that cause wrong positions of the surrounding obstacles received by a robot. In this paper, we propose a robust motion planning method DAE-Crit-LSTM, integrating a denoising autoencoder (DAE) with DRL models, to mitigate such position deception attacks in environments with a different number of obstacles. DAE-Crit-LSTM shows the following two advantages. First, DAE-Crit-LSTM can be applied in benign and attacked scenarios and thus does not require any detector. It learns an encoder and a decoder to approximate the accurate positions of the obstacles, no matter under attack or not. Second, DAE-Crit-LSTM applies an LSTM (Long Short-Term Memory)-based DRL model to deal with a variable number of obstacles in the environment. It is worth noting that DAE-Crit-LSTM is method-agnostic and can be easily implemented in state-of-the-art motion planning methods. Comprehensive experiments show that DAE-Crit-LSTM can mitigate position deception attacks and guarantee safe motion. We also demonstrate the effectiveness and generalization of DAE-Crit-LSTM.
AB - Deep reinforcement learning (DRL) is widely applied in motion planning for multi-robot systems as DRL leverages the offline training process to improve the real-time computation efficiency. In DRL-based methods, the DRL models compute an action for a robot based on the states of its surrounding obstacles, including other robots in the system. They always assume that the number of obstacles is fixed and the obtained obstacles' states are reliable. However, in the real world, a multi-robot system may suffer from various attacks, such as remote control attacks and network attacks, that cause wrong positions of the surrounding obstacles received by a robot. In this paper, we propose a robust motion planning method DAE-Crit-LSTM, integrating a denoising autoencoder (DAE) with DRL models, to mitigate such position deception attacks in environments with a different number of obstacles. DAE-Crit-LSTM shows the following two advantages. First, DAE-Crit-LSTM can be applied in benign and attacked scenarios and thus does not require any detector. It learns an encoder and a decoder to approximate the accurate positions of the obstacles, no matter under attack or not. Second, DAE-Crit-LSTM applies an LSTM (Long Short-Term Memory)-based DRL model to deal with a variable number of obstacles in the environment. It is worth noting that DAE-Crit-LSTM is method-agnostic and can be easily implemented in state-of-the-art motion planning methods. Comprehensive experiments show that DAE-Crit-LSTM can mitigate position deception attacks and guarantee safe motion. We also demonstrate the effectiveness and generalization of DAE-Crit-LSTM.
KW - Deep reinforcement learning
KW - denoising autoencoder
KW - motion planning
KW - multi-robot systems
KW - position deception attacks
UR - https://www.scopus.com/pages/publications/85181563211
U2 - 10.1109/TIFS.2023.3346647
DO - 10.1109/TIFS.2023.3346647
M3 - 文章
AN - SCOPUS:85181563211
SN - 1556-6013
VL - 19
SP - 2157
EP - 2170
JO - IEEE Transactions on Information Forensics and Security
JF - IEEE Transactions on Information Forensics and Security
ER -