When prompt perturbations break your A/B test: A valid statistical test for generative surveying 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

When prompt perturbations break your A/B test: A valid statistical test for generative surveying arXiv:2605.27463v1 Announce Type: cross Abstract: Generative surveying -- where collections of LLM-based personas provide feedback on messages -- has emerged as a cheap and scalable alternative to traditional market research. However, LLMs are sensitive to small variations in prompt design and conclusions drawn from generative surveys may depend on arbitrary phrasing choices. Controlling for this se

When prompt perturbations break your A/B test: A valid statistical test for generative surveying · 相关人物

暂无数据