TY - GEN
T1 - Informative Prompt Learning for Low-Shot Commonsense Question Answering via Fine-Grained Redundancy Reduction
AU - Lei, Zhikai
AU - Zhou, Jie
AU - Chen, Qin
AU - Zhang, Qi
AU - He, Liang
N1 - Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2024
Y1 - 2024
N2 - Low-shot commonsense question answering (CQA) poses a big challenge due to the absence of sufficient labeled data and commonsense knowledge. Recent work focuses on utilizing the potential of commonsense reasoning of pre-trained language models (PLMs) for low-shot CQA. In addition, various prompt learning methods have been studied to elicit implicit knowledge from PLMs for performance promotion. Whereas, it has been shown that PLMs suffer from the redundancy problem that many neurons encode similar information, especially under a small sample regime, leading prompt learning to be less informative in low-shot scenarios. In this paper, we propose an informative prompt learning approach, which aims to elicit more diverse and useful knowledge from PLMs for low-shot CQA via fine-grained redundancy reduction. Specifically, our redundancy-reduction method imposes restrictions upon the fine-grained neuron-level to encourage each dimension to model different knowledge or clues. Experiments on three benchmark datasets show the great advantages of our proposed approach in low-shot settings. Moreover, we conduct both quantitative and qualitative analyses, which shed light on why our approach can lead to great improvements.
AB - Low-shot commonsense question answering (CQA) poses a big challenge due to the absence of sufficient labeled data and commonsense knowledge. Recent work focuses on utilizing the potential of commonsense reasoning of pre-trained language models (PLMs) for low-shot CQA. In addition, various prompt learning methods have been studied to elicit implicit knowledge from PLMs for performance promotion. Whereas, it has been shown that PLMs suffer from the redundancy problem that many neurons encode similar information, especially under a small sample regime, leading prompt learning to be less informative in low-shot scenarios. In this paper, we propose an informative prompt learning approach, which aims to elicit more diverse and useful knowledge from PLMs for low-shot CQA via fine-grained redundancy reduction. Specifically, our redundancy-reduction method imposes restrictions upon the fine-grained neuron-level to encourage each dimension to model different knowledge or clues. Experiments on three benchmark datasets show the great advantages of our proposed approach in low-shot settings. Moreover, we conduct both quantitative and qualitative analyses, which shed light on why our approach can lead to great improvements.
KW - Commonsense question answering
KW - Low-shot learning
KW - Redundancy in Pre-trained language models
UR - https://www.scopus.com/pages/publications/85178557952
U2 - 10.1007/978-981-99-8181-6_29
DO - 10.1007/978-981-99-8181-6_29
M3 - 会议稿件
AN - SCOPUS:85178557952
SN - 9789819981809
T3 - Communications in Computer and Information Science
SP - 377
EP - 390
BT - Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings
A2 - Luo, Biao
A2 - Cheng, Long
A2 - Wu, Zheng-Guang
A2 - Li, Hongyi
A2 - Li, Chaojie
PB - Springer Science and Business Media Deutschland GmbH
T2 - 30th International Conference on Neural Information Processing, ICONIP 2023
Y2 - 20 November 2023 through 23 November 2023
ER -