When prompt perturbations break your A/B test: A valid statistical test for generative surveying 事件

Name: When prompt perturbations break your A/B test: A valid statistical test for generative surveying
Start: 2026-05-28

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

When prompt perturbations break your A/B test: A valid statistical test for generative surveying arXiv:2605.27463v1 Announce Type: cross Abstract: Generative surveying -- where collections of LLM-based personas provide feedback on messages -- has emerged as a cheap and scalable alternative to traditional market research. However, LLMs are sensitive to small variations in prompt design and conclusions drawn from generative surveys may depend on arbitrary phrasing choices. Controlling for this se

人工智能

关系图谱

When prompt perturbations break your A/B test: A valid statistical test for generative surveying · 相关人物

暂无数据