ComfyUI 节点式AI图像生成 是 AI Skill Hub 本期精选AI工具之一。已获得 2.4k 颗 GitHub Star,综合评分 7.5 分,整体质量较高。我们推荐使用将其纳入你的 AI 工具库,帮助提升工作效率。
ComfyUI 节点式AI图像生成 是一款基于 Python 开发的开源工具,专注于 ai、comfyui、comfyui-nodes 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
ComfyUI 节点式AI图像生成 是一款基于 Python 开发的开源工具,专注于 ai、comfyui、comfyui-nodes 等核心功能。作为 GitHub 开源项目,它拥有活跃的社区支持和持续的版本迭代,代码完全透明可审计,支持本地部署以保护数据隐私。无论是个人使用还是集成到企业工作流,都能提供稳定可靠的解决方案。
# 方式一:pip 安装(推荐)
pip install comfyui-seedvr2_videoupscaler
# 方式二:虚拟环境安装(推荐生产环境)
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install comfyui-seedvr2_videoupscaler
# 方式三:从源码安装(获取最新功能)
git clone https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler
cd ComfyUI-SeedVR2_VideoUpscaler
pip install -e .
# 验证安装
python -c "import comfyui_seedvr2_videoupscaler; print('安装成功')"
# 命令行使用
comfyui-seedvr2_videoupscaler --help
# 基本用法
comfyui-seedvr2_videoupscaler input_file -o output_file
# Python 代码中调用
import comfyui_seedvr2_videoupscaler
# 示例
result = comfyui_seedvr2_videoupscaler.process("input")
print(result)
# comfyui-seedvr2_videoupscaler 配置文件示例(config.yml) app: name: "comfyui-seedvr2_videoupscaler" debug: false log_level: "INFO" # 运行时指定配置文件 comfyui-seedvr2_videoupscaler --config config.yml # 或通过环境变量配置 export COMFYUI_SEEDVR2_VIDEOUPSCALER_API_KEY="your-key" export COMFYUI_SEEDVR2_VIDEOUPSCALER_OUTPUT_DIR="./output"
Official release of SeedVR2 for ComfyUI that enables high-quality video and image upscaling.
Can run as Multi-GPU standalone CLI too, see 🖥️ Run as Standalone section.


Choose the appropriate setup based on your installation:
If you've already installed SeedVR2 as part of ComfyUI (via ComfyUI installation), you can use the CLI directly:
```bash
uv pip install -r requirements.txt
1. Clone the repository into your ComfyUI custom nodes directory:
cd ComfyUI
git clone https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler.git custom_nodes/seedvr2_videoupscaler
2. Install dependencies using standalone Python: ```bash
Models will be automatically downloaded on first use and saved to ComfyUI/models/SEEDVR2.
You can also manually download models from: - Main models available at numz/SeedVR2_comfyUI and AInVFX/SeedVR2_comfyUI - Additional GGUF models available at cmeka/SeedVR2-GGUF
SeedVR2 uses a modular node architecture with four specialized nodes:

Configure the DiT (Diffusion Transformer) model for video upscaling.
Parameters:
seedvr2_ema_3b_fp16.safetensors: FP16 (best quality)seedvr2_ema_3b_fp8_e4m3fn.safetensors: FP8 8-bit (good quality)seedvr2_ema_3b-Q4_K_M.gguf: GGUF 4-bit quantized (acceptable quality)seedvr2_ema_3b-Q8_0.gguf: GGUF 8-bit quantized (good quality)seedvr2_ema_7b_fp16.safetensors: FP16 (best quality)seedvr2_ema_7b_fp8_e4m3fn_mixed_block35_fp16.safetensors: FP8 with last block in FP16 to reduce artifacts (good quality)seedvr2_ema_7b-Q4_K_M.gguf: GGUF 4-bit quantized (acceptable quality)seedvr2_ema_7b_sharp_*: Sharp variants for enhanced detailcuda:0)none: Keep model on inference device (fastest, highest VRAM)cpu: Offload to system RAM (reduces VRAM)cuda:X: Offload to another GPU (good balance if available)0: Disabled (default)1-32: Number of transformer blocks to swap for 3B model1-36: Number of transformer blocks to swap for 7B modelsdpa: PyTorch scaled_dot_product_attention (default, always available)flash_attn_2: Flash Attention 2 (Ampere+, requires flash-attn package)flash_attn_3: Flash Attention 3 (Hopper+, requires flash-attn with FA3 support)sageattn_2: SageAttention 2 (requires sageattention package)sageattn_3: SageAttention 3 (Blackwell/RTX 50xx, requires sageattn3 package)BlockSwap Explained:
BlockSwap enables running large models on GPUs with limited VRAM by dynamically swapping transformer blocks between GPU and CPU memory during inference.
Note: BlockSwap is not available on macOS. Apple Silicon Macs use unified memory architecture where GPU and CPU share the same memory pool, making BlockSwap meaningless. The option will be automatically disabled with a warning if requested on macOS.
Here's how it works:
- What it does: Keeps only the currently-needed transformer blocks on the GPU, while storing the rest on CPU or another device - When to use it: When you get OOM (Out of Memory) errors during the upscaling phase - How to configure: 1. Set offload_device to cpu or another GPU 2. Start with blocks_to_swap=16 (half the blocks) 3. If still getting OOM, increase to 24 or 32 (3B) / 36 (7B) 4. Enable swap_io_components for maximum VRAM savings 5. If you have plenty of VRAM, decrease or set to 0 for faster processing
Example Configuration for Low VRAM (8GB): - model: seedvr2_ema_3b-Q8_0.gguf - device: cuda:0 - offload_device: cpu - blocks_to_swap: 32 - swap_io_components: True

Configure the VAE (Variational Autoencoder) model for encoding/decoding video frames.
Parameters:
ema_vae_fp16.safetensors: Default and recommendedcuda:0)none: Keep model on inference device (default, fastest)cpu: Offload to system RAM (reduces VRAM)cuda:X: Offload to another GPU (good balance if available)VAE Tiling Explained:
VAE tiling processes large resolutions in smaller tiles to reduce VRAM requirements. Here's how to use it:
1. Run without tiling first and monitor the debug logs (enable enable_debug on main node) 2. If OOM during "Encoding" phase: - Enable encode_tiled - If still OOM, reduce encode_tile_size (try 768, 512, etc.) 3. If OOM during "Decoding" phase: - Enable decode_tiled - If still OOM, reduce decode_tile_size 4. Adjust overlap (default 128) if you see visible seams in output (increase it) or processing times are too slow (decrease it).
Example Configuration for High Resolution (4K): - encode_tiled: True - encode_tile_size: 1024 - encode_tile_overlap: 128 - decode_tiled: True - decode_tile_size: 1024 - decode_tile_overlap: 128

Configure torch.compile optimization for 20-40% DiT speedup and 15-25% VAE speedup.
Requirements: - PyTorch 2.0+ - Triton (for inductor backend)
Parameters:
inductor: Full optimization with Triton kernel generation and fusion (recommended)cudagraphs: Lightweight wrapper using CUDA graphs, no kernel optimizationdefault: Fast compilation with good speedup (recommended for development)reduce-overhead: Lower overhead, optimized for smaller modelsmax-autotune: Slowest compilation, best runtime performance (recommended for production)max-autotune-no-cudagraphs: Like max-autotune but without CUDA graphsFalse: Allow graph breaks for better compatibility (default, recommended)True: Enforce no breaks for maximum optimization (may fail with dynamic shapes)False: Specialize for exact input shapes (default)True: Create dynamic kernels that adapt to shape variations (enable when processing different resolutions or batch sizes)Usage: 1. Add this node to your workflow 2. Connect its output to the torch_compile_args input of DiT and/or VAE loader nodes 3. First run will be slow (compilation), subsequent runs will be much faster
When to use: - torch.compile only makes sense when processing multiple batches, long videos, or many tiles - For single images or short clips, the compilation time outweighs the speed improvement - Best suited for batch processing workflows or long videos
Recommended Settings: - For development/testing: mode=default, backend=inductor, fullgraph=False - For production: mode=max-autotune, backend=inductor, fullgraph=False

Main upscaling node that processes video frames using DiT and VAE models.
Required Inputs:
Parameters:
uniform_batch_size (default: False) - Pads the final batch to match batch_size for uniform processing - Prevents temporal artifacts when the last batch is significantly smaller than others - Example: 45 frames with batch_size=33 creates [33, 33] instead of [33, 12] - Recommended when using large batch sizes and video length is not a multiple of batch_size - Increases VRAM usage slightly but ensures consistent temporal coherence across all batches
lab: Full perceptual color matching with detail preservation (recommended for highest fidelity to original)wavelet: Frequency-based natural colors, preserves details wellwavelet_adaptive: Wavelet base + targeted saturation correctionhsv: Hue-conditional saturation matchingadain: Statistical style transfernone: No color correctionnone: Keep all tensors on inference device (fastest but highest VRAM)cpu: Offload to system RAM (recommended for long videos, slower transfers)cuda:X: Offload to another GPU (good balance if available, faster than CPU)Output: - Upscaled video frames with color correction applied - Format (RGB/RGBA) matches input - Range [0, 1] normalized for ComfyUI compatibility
Basic Workflow (High VRAM - 24GB+):
Load Video Frames
↓
SeedVR2 Load DiT Model
├─ model: seedvr2_ema_3b_fp16.safetensors
└─ device: cuda:0
↓
SeedVR2 Load VAE Model
├─ model: ema_vae_fp16.safetensors
└─ device: cuda:0
↓
SeedVR2 Video Upscaler
├─ batch_size: 21
└─ resolution: 1080
↓
Save Video/Frames
Low VRAM Workflow (8-12GB):
Load Video Frames
↓
SeedVR2 Load DiT Model
├─ model: seedvr2_ema_3b-Q8_0.gguf
├─ device: cuda:0
├─ offload_device: cpu
├─ blocks_to_swap: 32
└─ swap_io_components: True
↓
SeedVR2 Load VAE Model
├─ model: ema_vae_fp16.safetensors
├─ device: cuda:0
├─ encode_tiled: True
└─ decode_tiled: True
↓
SeedVR2 Video Upscaler
├─ batch_size: 5
└─ resolution: 720
↓
Save Video/Frames
High Performance Workflow (24GB+ with torch.compile):
Load Video Frames
↓
SeedVR2 Torch Compile Settings
├─ mode: max-autotune
└─ backend: inductor
↓
SeedVR2 Load DiT Model
├─ model: seedvr2_ema_7b_sharp_fp16.safetensors
├─ device: cuda:0
└─ torch_compile_args: connected
↓
SeedVR2 Load VAE Model
├─ model: ema_vae_fp16.safetensors
├─ device: cuda:0
└─ torch_compile_args: connected
↓
SeedVR2 Video Upscaler
├─ batch_size: 81
└─ resolution: 1080
↓
Save Video/Frames
Complete walkthrough of version 2.5 by Adrien from AInVFX, covering the new 4-node architecture, GGUF support, memory optimizations, and production workflows:
This comprehensive tutorial covers: - Installing v2.5 through ComfyUI Manager and troubleshooting conflicts - Understanding the new 4-node modular architecture and why we rebuilt it - Running 7B models on 8GB VRAM with GGUF quantization - Configuring BlockSwap, VAE tiling, and torch.compile for your hardware - Image and video upscaling workflows with alpha channel support - CLI for batch processing and multi-GPU rendering - Memory optimization strategies for different VRAM levels - Real production tips and the critical batch_size formula (4n+1)
For reference, here's the original tutorial covering the initial release:
Note: This tutorial covers the previous single-node architecture. While the UI has changed significantly in v2.5, the core concepts about BlockSwap and memory management remain valuable.
The CLI provides comprehensive options for single-GPU, multi-GPU, and batch processing workflows.
Basic Usage Examples:
```bash
Registry Link: ComfyUI Registry - SeedVR2 Video Upscaler
uv venv --python 3.13
uv pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu130
该工具使用AI技术,提高视频质量,适用于ComfyUI用户,评分7.5/10
AI Skill Hub 为第三方内容聚合平台,本页面信息基于公开数据整理,不对工具功能和质量作任何法律背书。
建议在沙箱或测试环境中充分验证后,再部署至生产环境,并做好必要的安全评估。
✅ Apache 2.0 — 宽松开源协议,可商用,需保留版权声明和 NOTICE 文件,含专利授权条款。
经综合评估,ComfyUI 节点式AI图像生成 在AI工具赛道中表现稳健,质量良好。如果你已有明确的使用需求,可以直接上手体验;如果还在评估阶段,建议对比同类工具后再做决策。
| 原始名称 | ComfyUI-SeedVR2_VideoUpscaler |
| 原始描述 | 开源AI工具:Official SeedVR2 Video Upscaler for ComfyUI。⭐2.4k · Python |
| Topics | aicomfyuicomfyui-nodesupscalervideo-processing |
| GitHub | https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler |
| License | Apache-2.0 |
| 语言 | Python |
收录时间:2026-05-14 · 更新时间:2026-05-16 · License:Apache-2.0 · AI Skill Hub 不对第三方内容的准确性作法律背书。