TY - JOUR
T1 - Noisy Multi-Label Aggregation with Self-Supervised Graph Transformer in Mobile Crowdsourcing
AU - Liu, Jiacheng
AU - Tang, Feilong
AU - Liu, Hao
AU - Chen, Long
AU - Zhu, Yanmin
AU - Yu, Jiadi
AU - Yu, Yichuan
AU - Hou, Xiaofeng
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Aggregating noisy labels from mobile crowdsourcing (MCS) to recover true labels is a fundamental yet challenging problem, especially due to the sparsity and unreliability of crowd-contributed data. While most prior work addresses only single-label scenarios, real-world MCS applications often require robust solutions for both single-label and multi-label tasks, where each instance may be associated with multiple categories. In this paper, we propose ATHENA, a novel approach that leverages self-supervision signals inherent in MCS data for effective label aggregation. Firstly, we propose a graph transformer model that can learn from the MCS topology and features. Then, we propose self-supervision signals inherently included in the dataset to help aggregate the labels. To address the unique challenges of multi-label aggregation, we further extend our approach to ATHENA+, introducing a label message passing (LMP) module that explicitly models correlations and dependencies among labels. We conducted extensive experiments on multiple single-label and multi-label classification datasets, comparing the proposed models with state-of-the-art methods. Our results demonstrate that ATHENA and ATHENA+ are highly effective in aggregating labels and obtain much better performance than existing methods.
AB - Aggregating noisy labels from mobile crowdsourcing (MCS) to recover true labels is a fundamental yet challenging problem, especially due to the sparsity and unreliability of crowd-contributed data. While most prior work addresses only single-label scenarios, real-world MCS applications often require robust solutions for both single-label and multi-label tasks, where each instance may be associated with multiple categories. In this paper, we propose ATHENA, a novel approach that leverages self-supervision signals inherent in MCS data for effective label aggregation. Firstly, we propose a graph transformer model that can learn from the MCS topology and features. Then, we propose self-supervision signals inherently included in the dataset to help aggregate the labels. To address the unique challenges of multi-label aggregation, we further extend our approach to ATHENA+, introducing a label message passing (LMP) module that explicitly models correlations and dependencies among labels. We conducted extensive experiments on multiple single-label and multi-label classification datasets, comparing the proposed models with state-of-the-art methods. Our results demonstrate that ATHENA and ATHENA+ are highly effective in aggregating labels and obtain much better performance than existing methods.
KW - Graph Transformers
KW - Label Aggregation
KW - Mobile Crowdsourcing
KW - Multi-label Classification
UR - https://www.scopus.com/pages/publications/105025003443
U2 - 10.1109/TMC.2025.3642535
DO - 10.1109/TMC.2025.3642535
M3 - 文章
AN - SCOPUS:105025003443
SN - 1536-1233
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
ER -