TY - JOUR
T1 - Reliable Multimodal Semantic Communication for Audio-Visual Event Localization
AU - Li, Yuandi
AU - Xiang, Zhe
AU - Yu, Fei
AU - Zhang, Zhuoran
AU - Wang, Yanhao
AU - Guan, Zhangshuang
AU - Ji, Hui
AU - Wan, Zhiguo
N1 - Publisher Copyright:
© 1997-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - The widespread adoption of smart mobile devices and applications has driven an exponential growth in wireless data traffic, posing significant challenges to modern communication systems. Ensuring reliable task-oriented multimodal semantic communication has become increasingly critical. In this letter, we propose RMMSC, a novel framework designed to enhance the effectiveness and reliability of Audio-Visual Event (AVE) localization-driven multimodal semantic communication. Specifically, RMMSC improves the accuracy of multimodal semantic information through advanced semantic encoding and cross-modal feature integration. It employs a two-level coding scheme that combines error-correcting codes with semantic encoders to enhance the reliability of multimodal semantic transmission. As an optional design choice, RMMSC supports a hybrid encryption mechanism to protect transmitted data if required by the application context. Simulation results validate the effectiveness of RMMSC, demonstrating significant improvements in accuracy and reliability for the AVE task.
AB - The widespread adoption of smart mobile devices and applications has driven an exponential growth in wireless data traffic, posing significant challenges to modern communication systems. Ensuring reliable task-oriented multimodal semantic communication has become increasingly critical. In this letter, we propose RMMSC, a novel framework designed to enhance the effectiveness and reliability of Audio-Visual Event (AVE) localization-driven multimodal semantic communication. Specifically, RMMSC improves the accuracy of multimodal semantic information through advanced semantic encoding and cross-modal feature integration. It employs a two-level coding scheme that combines error-correcting codes with semantic encoders to enhance the reliability of multimodal semantic transmission. As an optional design choice, RMMSC supports a hybrid encryption mechanism to protect transmitted data if required by the application context. Simulation results validate the effectiveness of RMMSC, demonstrating significant improvements in accuracy and reliability for the AVE task.
KW - Semantic communication
KW - audio-visual event localization
KW - multimodal semantic communication
UR - https://www.scopus.com/pages/publications/105022688997
U2 - 10.1109/LCOMM.2025.3635906
DO - 10.1109/LCOMM.2025.3635906
M3 - 文章
AN - SCOPUS:105022688997
SN - 1089-7798
VL - 30
SP - 317
EP - 321
JO - IEEE Communications Letters
JF - IEEE Communications Letters
ER -