Mask the Target: A Plug-and-Play Regularizer Against LoRA Forgetting 文章

ArXiv CS.CV2026-05-29NEWSen作者: Runze Xu, Arpit Garg, Hemanth Saratchandran, Simon Lucey

摘要

arXiv:2605.29498v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) has become one of the most widely used fine-tuning mechanisms for adapting large language models to new domains, tasks, and users. Yet adaptation performance alone can obscure an important failure mode: LoRA updates may improve performance on the target distribution while degrading prior capabilities learned during pretraining and alignment. We show that this forgetting becomes especially severe when the adaptation distribution differs substantially from the models original training or alignment distributions. The challenge is amplified in practical settings, where the original training and alignment data are typically unavailable. Motivated by this constraint, we study how LoRA based adaptation balances new learning against forgetting in a replay-free setting, and introduce a simple output space regularizer that can be added directly to existing training pipelines.

Mask the Target: A Plug-and-Play Regularizer Against LoRA Forgetting 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (1)