Cultural Value Alignment Via Latent Activation Steering in Large Language Models 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
Cultural Value Alignment Via Latent Activation Steering in Large Language Models arXiv:2605.26365v1 Announce Type: new Abstract: Large Language Models (LLMs) often exhibit homogenized cultural perspectives. While the World Values Survey (WVS) provides a gold standard for mapping human values, traditional direct prompting of LLMs on WVS often fails to access the model's latent cultural depth, leading to safety-aligned refusals or neutral responses. Here, we propose a generalizable framework for
Cultural Value Alignment Via Latent Activation Steering in Large Language Models · 相关公司
W
World LabsRESEARCH_INSTITUTE
A
arXivNONPROFIT
I
IRECNONPROFIT
T
TRANSITIONSRESEARCH_INSTITUTE
H
HuMANONPROFIT
F
FrameworkCOMPANY
A
ACTNONPROFIT
I
ITUNONPROFIT
C
CulturaGOVERNMENT
V
VIACOMPANY