When prompt perturbations break your A/B test: A valid statistical test for generative surveying 文章

ArXiv CS.AI2026-05-28NEWSen作者: Hayden Helm, Carey Priebe

When prompt perturbations break your A/B test: A valid statistical test for generative surveying · 相关技术