Abstraction-Based Training for Robust Classification Models via Image Pixelation

  • Yang Chen
  • , Min Wu*
  • , Min Zhang*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Deep Neural Networks (DNNs) are vulnerable to specially designed attacks due to their limited robustness. Abstraction methods can help extract critical features for learning, thereby reducing the disturbance caused by insignificant information. In this paper, we propose a pixelation-based abstraction method to enhance the empirical robustness of DNNs. The method partitions image pixels into superpixels and assigns each an appropriate colour from a continuously updated palette. Two hyperparameters control the abstraction level, allowing for resolution adjustment. Training and evaluation are conducted on pixelated datasets. Extensive experiments across benchmarks and loss landscape analysis demonstrate that our method (i) reduces attack success rates by up to 26.37% while maintaining high accuracy; (ii) exhibits a significant defense against diverse attack methods; and (iii) achieves smoother loss landscapes, underscoring its potential to enhance model robustness.

Keywords

  • Abstraction
  • Adversarial Defense
  • Image Classification
  • Neural Network
  • Pixelation
  • Robustness

Fingerprint

Dive into the research topics of 'Abstraction-Based Training for Robust Classification Models via Image Pixelation'. Together they form a unique fingerprint.

Cite this