RadJEPA: Radiology Encoder for Chest X-Rays via Joint Embedding Predictive Architecture 文章

ArXiv CS.CV2026-05-27NEWSen作者: Anas Anwarul Haq Khan, Mariam Husain, Pratik Jalan, Kshitij Jadhav

摘要

arXiv:2601.15891v3 Announce Type: replace Abstract: Vision-language pretraining has driven much of the recent progress in medical image representation learning, but this paradigm is constrained by the availability of paired image-text data and by the reporting bias of clinical narratives. We ask whether competitive radiology encoders can be learned without any language supervision. We introduce RadJEPA, a self-supervised framework built on a Joint Embedding Predictive Architecture and pretrained on approximately 840K unlabeled chest X-ray images. The model learns to predict latent representations of masked target regions from a visible context region, an objective that differs from both image-text contrastive pretraining and DINO-style self-distillation by explicitly modelling conditional structure in representation space.

RadJEPA: Radiology Encoder for Chest X-Rays via Joint Embedding Predictive Architecture 文章

摘要

相关事件查看全部 (1)

相关公司查看全部 (3)

相关人物

相关产品查看全部 (9)

相关技术查看全部 (23)