MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging arXiv:2605.30904v1 Announce Type: new Abstract: Most visual tokenizers for image generation are bifurcated into two families with complementary limitations: continuous VAEs offer high-fidelity reconstruction but suffer from dense, entangled latents that are poorly suited for semantic control, whereas discrete VQ-based models enable autoregressive generation yet struggle with gradient sparsity, unstable training, and
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging
ArXiv CS.CV2026-06-01