Extreme Super-Resolution via Scale Autoregression and Preference Alignment

[Submitted on 24 May 2025 (v1), last revised 27 May 2025 (this version, v2)]

View a PDF of the paper titled Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment, by Bryan Sangwoo Kim and 2 other authors

View PDF

Abstract:Modern single-image super-resolution (SISR) models deliver photo-realistic results at the scale factors on which they are trained, but collapse when asked to magnify far beyond that regime. We address this scalability bottleneck with Chain-of-Zoom (CoZ), a model-agnostic framework that factorizes SISR into an autoregressive chain of intermediate scale-states with multi-scale-aware prompts. CoZ repeatedly re-uses a backbone SR model, decomposing the conditional probability into tractable sub-problems to achieve extreme resolutions without additional training. Because visual cues diminish at high magnifications, we augment each zoom step with multi-scale-aware text prompts generated by a vision-language model (VLM). The prompt extractor itself is fine-tuned using Generalized Reward Policy Optimization (GRPO) with a critic VLM, aligning text guidance towards human preference. Experiments show that a standard 4x diffusion SR model wrapped in CoZ attains beyond 256x enlargement with high perceptual quality and fidelity. Project Page: this https URL .

Submission history

From: Jong Chul Ye [view email]
[v1]
Sat, 24 May 2025 08:50:08 UTC (6,599 KB)
[v2]
Tue, 27 May 2025 16:02:29 UTC (6,599 KB)

Source link

Subscribe now

To access premium content

Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Submission history

Related

Get a Touch Bar MacBook Pro + MS Office for just $445

What PlayStation's 2026 Roadmap Looks Like Right Now

Svengoolie Wages “War of the Colossal Beast” TONIGHT! Our Preview

Yaber L2 Plus Projector drops to record-low price, but not for long

The Four Quadrants of Conformism

Podcast Rewind: An AYANEO Avalanche, Minecraft Houses, and Brendon Crashes Unwind

“The reality of playing Morrowind would not stand the test of time” – Bethesda vet doubts the merits of an The Elder Scrolls 3...