TY - JOUR
T1 - OIPF
T2 - An Orthogonal Inputs Perception Fusion Framework for Infrared Small Target Detection
AU - Ma, Qianwen
AU - Li, Xiaobo
AU - Wang, Shaowei
AU - Zhai, Jingsheng
AU - Zhao, Xingye
AU - Hu, Haofeng
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2025
Y1 - 2025
N2 - In infrared small target detection (ISTD), a common challenge for neural network methods is that as the network deepens, the sparse information of small targets becomes even more diffuse in the deeper layers, limiting the ability to extract high-level semantic features. To address this issue, we propose the Orthogonal Inputs Perception Fusion (OIPF) framework. The key idea is to enhance the learnable target features in deeper layers without increasing network depth. This is achieved by creating orthogonal data pairs through intensity inversion and enhancement, which are then fed into a dual-input framework to provide incremental information flow. In addition, we introduce a relationally aware module (RAM) that generates spatial weight maps by leveraging the relationships between data pairs across layers. This module helps the framework focus on target edges and complex background regions, ensuring that rich target information is maintained throughout the network. Through extensive testing on three datasets and ablation study, we validated the OIPF framework and RAM's superiority, as well as their low dependence on the dataset scale. By integrating these into existing models, we significantly enhance ISTD performance, proving our solution's effectiveness and robustness.
AB - In infrared small target detection (ISTD), a common challenge for neural network methods is that as the network deepens, the sparse information of small targets becomes even more diffuse in the deeper layers, limiting the ability to extract high-level semantic features. To address this issue, we propose the Orthogonal Inputs Perception Fusion (OIPF) framework. The key idea is to enhance the learnable target features in deeper layers without increasing network depth. This is achieved by creating orthogonal data pairs through intensity inversion and enhancement, which are then fed into a dual-input framework to provide incremental information flow. In addition, we introduce a relationally aware module (RAM) that generates spatial weight maps by leveraging the relationships between data pairs across layers. This module helps the framework focus on target edges and complex background regions, ensuring that rich target information is maintained throughout the network. Through extensive testing on three datasets and ablation study, we validated the OIPF framework and RAM's superiority, as well as their low dependence on the dataset scale. By integrating these into existing models, we significantly enhance ISTD performance, proving our solution's effectiveness and robustness.
KW - Deep learning
KW - feature fusion
KW - infrared small target
KW - orthogonal input model
KW - relationally aware
KW - target detection
UR - https://www.scopus.com/pages/publications/105002299542
U2 - 10.1109/TAES.2025.3558181
DO - 10.1109/TAES.2025.3558181
M3 - 文章
AN - SCOPUS:105002299542
SN - 0018-9251
VL - 61
SP - 9686
EP - 9701
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
IS - 4
ER -