TY - GEN
T1 - Visual-to-Semantic Hashing for Zero Shot Learning
AU - Li, Xin
AU - Wen, Xiaoyue
AU - Jin, Bo
AU - Wang, Xiangfeng
AU - Wang, Junjie
AU - Cai, Jinghui
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - Hashing-based multimedia retrieval are facing the problem of the dramatic increase of data, especially new unseen categories. It is time-consuming, expensive, and sometimes impractical to label new samples and retrain the hashing model. Recently, several zero-shot hashing methods are proposed to generate the hash function with good generalization for unseen classes, via exploring semantic information and similarity relationship. However, the performance of existing solutions is still not satisfying. Therefore, we propose a modified two-stage framework, called Visual-to-Semantic Hashing (VSH). To fully exploit the semantic information, visual feature is firstly mapped to the semantic space, and then generate its hash codes. To transfer supervised knowledge from seen classes to unseen classes, a margin-based ranking loss is employed to learn the semantic structure. To boost the discriminability of semantic mapping, a classification module is adopted to distinguish between different semantic mapping vectors. Plenty of experiments demonstrate that the proposed VSH is superior to state-of-the-art methods.
AB - Hashing-based multimedia retrieval are facing the problem of the dramatic increase of data, especially new unseen categories. It is time-consuming, expensive, and sometimes impractical to label new samples and retrain the hashing model. Recently, several zero-shot hashing methods are proposed to generate the hash function with good generalization for unseen classes, via exploring semantic information and similarity relationship. However, the performance of existing solutions is still not satisfying. Therefore, we propose a modified two-stage framework, called Visual-to-Semantic Hashing (VSH). To fully exploit the semantic information, visual feature is firstly mapped to the semantic space, and then generate its hash codes. To transfer supervised knowledge from seen classes to unseen classes, a margin-based ranking loss is employed to learn the semantic structure. To boost the discriminability of semantic mapping, a classification module is adopted to distinguish between different semantic mapping vectors. Plenty of experiments demonstrate that the proposed VSH is superior to state-of-the-art methods.
KW - Hashing
KW - cross-domain
KW - multimedia retrieval
KW - zero shot
UR - https://www.scopus.com/pages/publications/85093834320
U2 - 10.1109/IJCNN48605.2020.9207198
DO - 10.1109/IJCNN48605.2020.9207198
M3 - 会议稿件
AN - SCOPUS:85093834320
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 International Joint Conference on Neural Networks, IJCNN 2020
Y2 - 19 July 2020 through 24 July 2020
ER -