Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging 文章

ArXiv CS.CL2026-06-01NEWSen作者: Haobo Zhang, Jiayu Zhou

摘要

arXiv:2505.22934v2 Announce Type: replace Abstract: Fine-tuning large language models (LMs) for individual tasks yields strong performance but is expensive for deployment and storage. Recent works explore model merging to combine multiple task-specific models into a single multi-task model without additional training. However, existing merging methods often fail for models fine-tuned with low-rank adaptation (LoRA), due to significant performance degradation. In this paper, we show that this issue arises from a previously overlooked interplay between model parameters and data distributions. We propose Orthogonal Subspaces for Robust model Merging (OSRM) to constrain the LoRA subspace *prior* to fine-tuning, ensuring that updates relevant to one task do not adversely shift outputs for others. Our approach can seamlessly integrate with most existing merging algorithms, reducing the unintended interference among tasks.

Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (2)