TY - GEN
T1 - Enhancing Visual Understanding by Removing Dithering with Global and Self-Conditioned Transformation
AU - Huang, Yifei
AU - Li, Chenhui
AU - Liu, Risheng
AU - Liang, Tianyi
AU - Wang, Changbo
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/9/22
Y1 - 2023/9/22
N2 - PNG-8 images are commonly used on the web due to their small size, but their limited color palette often leads to dithering artifacts. Unfortunately, restoring these images using a conventional convolutional neural network (CNN) often results in suboptimal performance since the spatial distribution of dithering is not uniform across the image. This is because the convolutional operator is spatially consistent, meaning it applies the same kernel to all pixels, which we refer to as a global transformation. To address this issue, we propose PNG8IRNet, one approach that combines global and self-conditioned transformations to remove dithering artifacts. Our method incorporates a multilayer perceptron (MLP) to generate diverse kernels for each pixel, taking into account the spatial non-uniformity of dithering, which we define as a self-conditioned transformation. PNG8IRNet demonstrates its performance on multiple datasets, substantially enhancing visual comprehension through a comprehensive set of experiments.
AB - PNG-8 images are commonly used on the web due to their small size, but their limited color palette often leads to dithering artifacts. Unfortunately, restoring these images using a conventional convolutional neural network (CNN) often results in suboptimal performance since the spatial distribution of dithering is not uniform across the image. This is because the convolutional operator is spatially consistent, meaning it applies the same kernel to all pixels, which we refer to as a global transformation. To address this issue, we propose PNG8IRNet, one approach that combines global and self-conditioned transformations to remove dithering artifacts. Our method incorporates a multilayer perceptron (MLP) to generate diverse kernels for each pixel, taking into account the spatial non-uniformity of dithering, which we define as a self-conditioned transformation. PNG8IRNet demonstrates its performance on multiple datasets, substantially enhancing visual comprehension through a comprehensive set of experiments.
KW - dithering removing
KW - neural networks
KW - spatial non-uniformity
KW - visual understanding
UR - https://www.scopus.com/pages/publications/85178326082
U2 - 10.1145/3615522.3615549
DO - 10.1145/3615522.3615549
M3 - 会议稿件
AN - SCOPUS:85178326082
T3 - ACM International Conference Proceeding Series
BT - 16th International Symposium on Visual Information Communication and Interaction, VINCI 2023
PB - Association for Computing Machinery
T2 - 16th International Symposium on Visual Information Communication and Interaction, VINCI 2023
Y2 - 22 September 2023 through 24 September 2023
ER -