跳到主要导航 跳到搜索 跳到主要内容

Reducing unknown unknowns with guidance in image caption

  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep recurrent models applied in Image Caption, which link up computer vision and natural language processing, have achieved excellent results enabling automatically generating natural sentences describing an image. However, the mismatch of sample distribution between training data and the open world may leads to tons of hiding-in-dark Unknown Unknowns (UUs). And such errors may greatly harm the correctness of generated captions. In this paper, we present a framework targeting on UUs reduction and model optimization based on recurrently training with small amounts of external data detected under assistance of crowd commonsense. We demonstrate and analyze our method with currently state-of-the-art image-to-text model. Aiming at reducing the number of UUs in generated captions, we obtain over 12% of UUs reduction and reinforcement of model cognition on these scenes.

源语言英语
主期刊名Artificial Neural Networks and Machine Learning – ICANN 2017 - 26th International Conference on Artificial Neural Networks, Proceedings
编辑Alessandra Lintas, Alessandro E. Villa, Stefano Rovetta, Paul F. Verschure
出版商Springer Verlag
547-555
页数9
ISBN(印刷版)9783319686110
DOI
出版状态已出版 - 2017
活动26th International Conference on Artificial Neural Networks, ICANN 2017 - Alghero, 意大利
期限: 11 9月 201714 9月 2017

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
10614 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议26th International Conference on Artificial Neural Networks, ICANN 2017
国家/地区意大利
Alghero
时期11/09/1714/09/17

指纹

探究 'Reducing unknown unknowns with guidance in image caption' 的科研主题。它们共同构成独一无二的指纹。

引用此