How Accurate are Video Quality Models for Diffusion-Based Video Super-Resolution? 文章

ArXiv CS.CV2026-05-26NEWSen作者: Benjamin Herb, Steve G\"oring, Alexander Raake, Rakesh Rao Ramachandra Rao

摘要

arXiv:2605.25940v1 Announce Type: cross Abstract: Recent video super-resolution (VSR) approaches use deep neural networks to enhance low-quality input videos and recover visual detail, with diffusion-based methods in particular showing promising results. In this paper, we investigate whether existing video quality models can be used to assess the performance of these diffusion-based VSR methods, by comparing model predictions with results from a subjective test. The study compares six upscaling methods (Lanczos, Rhea, SCST, DOVE, SeedVR2, Starlight Mini) applied to both compressed (AV1 and DCVC-RT) and uncompressed low-resolution videos considering the play-out on a UHD-1/4K screen. A range of full- and no-reference quality models are used to assess their applicability to this new type of quality degradation, focusing on within-sequence performance.