TRACER: Persistent Regularization for Robust Multimodal Finetuning 文章

ArXiv CS.CV2026-05-29NEWSen作者: Hesam Asadollahzadeh, Feng Liu, Christopher Leckie, Sarah M. Erfani

摘要

arXiv:2605.29380v1 Announce Type: cross Abstract: Mainstream strategies for finetuning pretrained multimodal models often degrade out-of-distribution (OOD) robustness, a phenomenon known as catastrophic forgetting. In this paper, we develop a theoretical framework for multimodal contrastive finetuning, yielding closed-form solutions and a geometric decomposition for each strategy. This framework shows that self-distillation is more effective than other regularization approaches to retain the knowledge of the pretrained model. Our analysis reveals a largely overlooked limitation: standard Exponential Moving Average (EMA) teachers, widely used in robust finetuning, suffer from collapse. To solve this, we prove that a Weighted Moving Average (WMA) teacher maintains a persistent regularizing force over finite horizons and yields bias-free convergence in the task subspace while preserving orthogonal knowledge.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据