PACZero: PAC-Private Fine-Tuning of Language Models via Sign Quantization 文章

ArXiv CS.AI2026-05-26NEWSen作者: Murat Bilgehan Ertan, Xiaochen Zhu, Phuong Ha Nguyen, Marten van Dijk, Srinivas Devadas

详细信息

来源站点
ArXiv CS.AI
作者
Murat Bilgehan Ertan, Xiaochen Zhu, Phuong Ha Nguyen, Marten van Dijk, Srinivas Devadas
文章类型
NEWS
语言
en
发布日期
2026-05-26

摘要

arXiv:2605.06505v2 Announce Type: replace-cross Abstract: We introduce PACZero, a family of PAC-private zeroth-order mechanisms for fine-tuning large language models that delivers usable utility at $I(S^*; Y_{1:T})=0$. This privacy regime bounds the membership-inference attack (MIA) posterior success rate at the prior, an MIA-resistance level the DP framework matches only at $\varepsilon=0$ and infinite noise. All DP-ZO comparisons below are matched at the MIA posterior level. The key insight is that PAC Privacy charges mutual information only when the release depends on which candidate subset is the secret. Sign-quantizing subset-aggregated zeroth-order gradients creates frequent unanimity, steps at which every candidate subset agrees on the update direction; at these steps the released sign costs zero conditional mutual information.

相关事件

暂无数据

相关公司查看全部 (4)

A
ANDINONPROFIT
A
ACTNONPROFIT

相关人物

暂无数据