Abstract
Dynamic resolution network is proved to be crucial in reducing computational redundancy by automatically assigning satisfactory resolution for each input image. However, it is observed that resolution choices are often collapsed, where prior works tend to assign images to the resolution routes whose computational cost is close to the required FLOPs. In this paper, we propose a novel optimal transport dynamic resolution network (OTD-Net) by establishing an intrinsic connection between resolution assignment and optimal transport problem. In this framework, each sample owns a resolution assignment choice viewed as supplier, and each resolution requires unallocated images considered as demander. With two assignment priors, OTD-Net benefits from the non-collapse division under theoretical support, and produces the desired assignment policy by balancing the computation budget and prediction accuracy. On that basis, a multi-resolution inference is proposed to ensemble low-resolution predictions. Extensive experiments including image classification, object detection and depth estimation, show our approach is both efficient and effective for both ResNet and Transformer, achieving state-of-the-art performance on various benchmarks.
| Original language | English |
|---|---|
| Pages (from-to) | 6187-6200 |
| Number of pages | 14 |
| Journal | International Journal of Computer Vision |
| Volume | 133 |
| Issue number | 9 |
| DOIs | |
| State | Published - Sep 2025 |
Keywords
- Computational Redundancy
- Dynamic Inference
- Dynamic Resolution Network
- Model Compression
Fingerprint
Dive into the research topics of 'Optimal Transport with Arbitrary Prior for Dynamic Resolution Network'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver