摘要
This paper presents a summary of the VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models (LMMs), hosted as part of the ICCV 2025 Work-shop on Visual Quality Assessment. The challenge aims to evaluate and enhance the ability of state-of-the-art LMMs to perform open-ended and detailed reasoning about visual quality differences across multiple images. To this end, the competition introduces a novel benchmark comprising thousands of coarse-to-fine grained visual quality comparison tasks, spanning single images, pairs, and multi-image groups. Each task requires models to provide accurate quality judgments. The competition emphasizes holistic evaluation protocols, including 2AFC-based binary preference and multi-choice questions (MCQs). Around 100 participants submitted entries, with five models demonstrating the emerging capabilities of instruction-tuned LMMs on quality assessment. This challenge marks a significant step toward open-domain visual quality reasoning and comparison and serves as a catalyst for future research on inter-pretable and human-aligned quality evaluation systems.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 |
| 出版商 | Institute of Electrical and Electronics Engineers Inc. |
| 页 | 3383-3393 |
| 页数 | 11 |
| ISBN(电子版) | 9798331589882 |
| DOI | |
| 出版状态 | 已出版 - 2025 |
| 活动 | 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 - Honolulu, 美国 期限: 19 10月 2025 → 20 10月 2025 |
出版系列
| 姓名 | Proceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 |
|---|
会议
| 会议 | 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 |
|---|---|
| 国家/地区 | 美国 |
| 市 | Honolulu |
| 时期 | 19/10/25 → 20/10/25 |
指纹
探究 'VQualA 2025 Challenge on Visual Quality Comparison for Large Multimodal Models: Methods and Results' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver