摘要
Deep neural networks are vulnerable to adversarial examples, which has been threatening their application in safety-critical scenarios. Based on the explanation that adversarial examples arise from the highly linear behavior of neural networks, a model robustness enhancement algorithm based on scale-invariant condition number constraint is proposed. Firstly, all weight matrices are used to calculate their norms during the adversarial training process, and the scale-invariant constraint term is obtained through the logarithmic function. Secondly, the scale-invariant condition number constraint item is incorporated into the outer framework of adversarial training optimization, and the condition number value of all weight matrices are iteratively reduced through backpropagation, thereby performing linear transformation of the neural network in a well-conditioned high-dimensional weight space, to improve robustness against adversarial perturbations. This algorithm is suitable for visual models of both convolution and Transformer architectures. It can not only significantly improve the robust accuracy against white-box attacks such as PGD and AutoAttack, but also effectively enhance the adversarial robustness of defending against black-box attack algorithms including square attack. Incorporating the proposed constraint during adversarial training on Transformer-based image classification model, the condition number value of weight matrices drops by 20.7% on average, the robust accuracy can be increased by 1.16 percentage points when defending against PGD attacks. Compared with similar methods such as Lipschitz constraints, the proposed method can also improve the accuracy of clean examples and alleviate the problem of low generalization caused by adversarial training.
| 投稿的翻译标题 | Model Robustness Enhancement Algorithm with Scale Invariant Condition Number Constraint |
|---|---|
| 源语言 | 繁体中文 |
| 页(从-至) | 140-147 |
| 页数 | 8 |
| 期刊 | Computer Engineering and Applications |
| 卷 | 60 |
| 期 | 8 |
| DOI | |
| 出版状态 | 已出版 - 15 4月 2024 |
| 已对外发布 | 是 |
关键词
- adversarial robustness
- adversarial training
- condition number
- image classification
- scale-invariance
指纹
探究 '尺度不变的条件数约束的模型鲁棒性增强算法' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver