Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models arXiv:2509.24319v4 Announce Type: replace Abstract: Large language models can express values in two main ways: (1) intrinsic expression, reflecting the model's inherent values learned during training, and (2) prompted expression, elicited by explicit prompts. Given their widespread use in value alignment, it is paramount to clearly understand their underlying mechanisms, particularly whether they mostly