Concept Heterogeneity-aware Representation Steering 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Concept Heterogeneity-aware Representation Steering arXiv:2603.02237v2 Announce Type: replace-cross Abstract: Representation steering offers a lightweight mechanism for controlling the behavior of large language models (LLMs) by intervening on internal activations at inference time. Most existing methods rely on a single global steering direction, typically obtained via difference-in-means over contrastive datasets. This approach implicitly assumes that the target concept is homogeneously repre