Discover and explore top open-source AI tools and projects—updated daily.
taco-groupControllable video super-resolution via sparse keyframe propagation
New!
Top 82.4% on SourcePulse
SparkVSR introduces an interactive Video Super-Resolution (VSR) framework, addressing the limitations of black-box VSR models by enabling users to control output quality via sparse keyframes. This project targets researchers and practitioners seeking controllable VSR solutions, offering improved temporal consistency and restoration quality, with potential applications beyond VSR.
How It Works
SparkVSR employs a two-stage training pipeline: first, a keyframe-conditioned latent-pixel approach fuses low-resolution video latents with sparsely encoded high-resolution keyframe latents for robust cross-space propagation. The second stage refines perceptual details in pixel space. At inference, it supports flexible keyframe selection (manual, codec I-frame, random sampling) and a reference-free guidance mechanism that balances adherence to keyframes with blind restoration, ensuring robust performance even with imperfect references.
Quick Start & Requirements
conda create -n sparkvsr python=3.10), activate it, and install dependencies (pip install -r requirements.txt). A specific PyTorch installation for CUDA 12.4 is provided: pip install torch==2.5.0 torchvision==0.20.0 torchaudio==2.5.0 --index-url https://download.pytorch.org/whl/cu124.prepare_dataset.py. Inference modes may require external API keys or separate model installations (e.g., PiSA-SR).Highlighted Details
Maintenance & Community
The project was released on March 17, 2026. Key contributors are listed as authors from Texas A&M University and YouTube/Google. Several items are marked as TODO, including releasing inference code, pre-trained models, project page, and ComfyUI integration. No community channels (Discord, Slack) or social handles are provided.
Licensing & Compatibility
The license type is not explicitly stated in the README. Compatibility for commercial use or linking with closed-source projects is therefore undetermined.
Limitations & Caveats
As a newly released project (March 2026), SparkVSR has several outstanding TODO items, indicating it is likely in an early development or alpha stage. Training demands significant hardware resources (4x A100 GPUs), and certain inference modes require external dependencies and API keys, adding complexity to setup. The absence of a stated license is a critical adoption blocker for many use cases.
1 week ago
Inactive
hao-ai-lab
SkyworkAI