Mask the Target: A Plug-and-Play Regularizer Against LoRA Forgetting 文章

ArXiv CS.CV2026-05-29NEWSen作者: Runze Xu, Arpit Garg, Hemanth Saratchandran, Simon Lucey

摘要

arXiv:2605.29498v1 Announce Type: cross Abstract: Low-Rank Adaptation (LoRA) has become one of the most widely used fine-tuning mechanisms for adapting large language models to new domains, tasks, and users. Yet adaptation performance alone can obscure an important failure mode: LoRA updates may improve performance on the target distribution while degrading prior capabilities learned during pretraining and alignment. We show that this forgetting becomes especially severe when the adaptation distribution differs substantially from the models original training or alignment distributions. The challenge is amplified in practical settings, where the original training and alignment data are typically unavailable. Motivated by this constraint, we study how LoRA based adaptation balances new learning against forgetting in a replay-free setting, and introduce a simple output space regularizer that can be added directly to existing training pipelines.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据