Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation 文章

ArXiv CS.CV2026-05-26NEWSen作者: Imanol G. Estepa, Jes\'us M Rodr\'iguez-de-Vera, Bhalaji Nagarajan, Petia Radeva

摘要

arXiv:2605.25012v1 Announce Type: new Abstract: Discriminative and generative vision models excel in their respective domains but remain semantically misaligned, hindering progress toward unified visual learning. We introduce LEASE (LEArning from SEmantic Dictionaries), a self-supervised framework that bridges this gap using a paired generative-discriminative codebook design. LEASE operates entirely in a discrete token space produced through a one-time precomputation step, enabling efficient training without data augmentations, teacher models, or online tokenizers. LEASE integrates two complementary objectives: a masked token reconstruction loss that captures fine-grained generative detail, and a codebook contrast loss that aligns encoder features with discriminative semantics via adaptive centroid weighting. This dual supervision yields a unified latent space that supports both high-quality generation and strong representation learning.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据