TY - GEN
T1 - Classify Sentence from Multiple Perspectives with Category Expert Attention Network
AU - Chen, Shiyun
AU - Wang, Maoquan
AU - Zhang, Jiacheng
AU - He, Liang
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/10
Y1 - 2018/10/10
N2 - Attention mechanisms achieve promising performance in text classification. When classifying the sentence into a certain category among multiple candidates, the existing attention utilizes a unified attention weight vector to determine contribution of each word. We observe that it is not accurate enough for a single vector to capture diverse category features of a sentence. In this paper, we propose a Category Expert Attention matrix to extract sentence features from different category perspectives. Each row of the attention matrix represents the unique feature of each category. We evaluate our model on four large scale datasets. Empirical results show that our model outperforms previous baseline methods by a significant margin. Especially on the NLPCC dataset, we achieve new state-of-the-art results. We also conduct visualization experiments to interpret how category expert information analyzes sentences.
AB - Attention mechanisms achieve promising performance in text classification. When classifying the sentence into a certain category among multiple candidates, the existing attention utilizes a unified attention weight vector to determine contribution of each word. We observe that it is not accurate enough for a single vector to capture diverse category features of a sentence. In this paper, we propose a Category Expert Attention matrix to extract sentence features from different category perspectives. Each row of the attention matrix represents the unique feature of each category. We evaluate our model on four large scale datasets. Empirical results show that our model outperforms previous baseline methods by a significant margin. Especially on the NLPCC dataset, we achieve new state-of-the-art results. We also conduct visualization experiments to interpret how category expert information analyzes sentences.
UR - https://www.scopus.com/pages/publications/85056488504
U2 - 10.1109/IJCNN.2018.8489556
DO - 10.1109/IJCNN.2018.8489556
M3 - 会议稿件
AN - SCOPUS:85056488504
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2018 International Joint Conference on Neural Networks, IJCNN 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 International Joint Conference on Neural Networks, IJCNN 2018
Y2 - 8 July 2018 through 13 July 2018
ER -