跳到主要导航 跳到搜索 跳到主要内容

CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos

  • Wei Sun
  • , Linhan Cao
  • , Kang Fu
  • , Dandan Zhu*
  • , Jun Jia
  • , Menghan Hu
  • , Xiongkuo Min
  • , Guangtao Zhai*
  • *此作品的通讯作者
  • East China Normal University
  • Shanghai Jiao Tong University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Video compression is a standard procedure applied to all videos to minimize storage and transmission demands while preserving visual quality as much as possible. Therefore, evaluating the visual quality of compressed videos is crucial for guiding the practical usage and further development of video compression algorithms. Although numerous compressed video quality assessment (VQA) methods have been proposed, they often lack the generalization capability needed to handle the increasing diversity of video types, particularly high dynamic range (HDR) content. In this paper, we introduce CompressedVQA-HDR, an effective VQA framework designed to address the challenges of HDR video quality assessment. Specifically, we adopt the Swin Transformer and SigLip 2 as the backbone networks for the proposed full-reference (FR) and no-reference (NR) VQA models, respectively. For the FR model, we compute deep structural and textural similarities between reference and distorted frames using intermediate-layer features extracted from the Swin Transformer as its quality-aware feature representation. For the NR model, we extract the global mean of the final-layer feature maps from SigLip 2 as its quality-aware representation. To mitigate the issue of limited HDR training data, we pre-train the FR model on a large-scale standard dynamic range (SDR) VQA dataset and fine-tune it on the HDRSDR-VQA dataset. For the NR model, we employ an iterative mixed-dataset training strategy across multiple compressed VQA datasets, followed by fine-tuning on the HDRSDR-VQA dataset. Experimental results show that our models achieve state-of-the-art performance compared to existing FR and NR VQA models. The code is available at https://github.com/sunwei925/CompressedVQA-HDR.

源语言英语
主期刊名IEEE International Conference on Multimedia and Expo Workshops
主期刊副标题Journey to the Center of Machine Imagination, ICMEW 2025 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331587437
DOI
出版状态已出版 - 2025
活动2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025 - Nantes, 法国
期限: 30 6月 20254 7月 2025

出版系列

姓名IEEE International Conference on Multimedia and Expo Workshops: Journey to the Center of Machine Imagination, ICMEW 2025 - Proceedings

会议

会议2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025
国家/地区法国
Nantes
时期30/06/254/07/25

指纹

探究 'CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos' 的科研主题。它们共同构成独一无二的指纹。

引用此