跳到主要导航 跳到搜索 跳到主要内容

FocusPatch AD: Few-Shot Multi-Class Anomaly Detection With Unified Keywords Patch Prompts

  • Xicheng Ding
  • , Xiaofan Li
  • , Mingang Chen
  • , Jingyu Gong*
  • , Yuan Xie
  • *此作品的通讯作者
  • East China Normal University
  • Macau University of Science and Technology
  • Shanghai Key Laboratory of Computer Software Evaluating and Testing

科研成果: 期刊稿件文章同行评审

摘要

Industrial few-shot anomaly detection (FSAD) requires identifying various abnormal states by leveraging as few normal samples as possible (abnormal samples are unavailable during training). However, current methods often require training a separate model for each category, leading to increased computation and storage overhead. Thus, designing a unified anomaly detection model that supports multiple categories remains a challenging task, as such a model must recognize anomalous patterns across diverse objects and domains. To tackle these challenges, this paper introduces FocusPatch AD, a unified anomaly detection framework based on vision-language models, achieving anomaly detection under few-shot multi-class settings. FocusPatch AD links anomaly state keywords to highly relevant discrete local regions within the image, guiding the model to focus on cross-category anomalies while filtering out background interference. This approach mitigates the false detection issues caused by global semantic alignment in vision-language models. We evaluate the proposed method on the MVTec, VisA, and Real-IAD datasets, comparing them against several prevailing anomaly detection methods. In both image-level and pixel-level anomaly detection tasks, FocusPatch AD achieves significant gains in classification and localization performance, demonstrating excellent generalization and adaptability.

源语言英语
页(从-至)112-123
页数12
期刊IEEE Transactions on Image Processing
35
DOI
出版状态已出版 - 2026

指纹

探究 'FocusPatch AD: Few-Shot Multi-Class Anomaly Detection With Unified Keywords Patch Prompts' 的科研主题。它们共同构成独一无二的指纹。

引用此