跳到主要导航 跳到搜索 跳到主要内容

Relationship Prompt Learning is Enough for Open-Vocabulary Semantic Segmentation

  • Jiahao Li
  • , Yang Lu
  • , Yuan Xie*
  • , Yanyun Qu*
  • *此作品的通讯作者
  • Xiamen University
  • Normal University

科研成果: 期刊稿件会议文章同行评审

摘要

Open-vocabulary semantic segmentation (OVSS) aims to segment unseen classes without corresponding labels. Existing Vision-Language Model (VLM)based methods leverage VLM's rich knowledge to enhance additional explicit segmentation-specific networks, yielding competitive results, but at the cost of extensive training cost. To reduce the cost, we attempt to enable VLM to directly produce the segmentation results without any segmentation-specific networks. Prompt learning offers a direct and parameter-efficient approach, yet it falls short in guiding VLM for pixel-level visual classification. Therefore, we propose the Relationship Prompt Module (RPM), which generates the relationship prompt that directs VLM to extract pixel-level semantic embeddings suitable for OVSS. Moreover, RPM integrates with VLM to construct the Relationship Prompt Network (RPN), achieving OVSS without any segmentation-specific networks. RPN attains state-of-the-art performance with merely about 3M trainable parameters (2% of total parameters).

源语言英语
期刊Advances in Neural Information Processing Systems
37
出版状态已出版 - 2024
活动38th Conference on Neural Information Processing Systems, NeurIPS 2024 - Vancouver, 加拿大
期限: 9 12月 202415 12月 2024

指纹

探究 'Relationship Prompt Learning is Enough for Open-Vocabulary Semantic Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此