Chiaroscuro Attention: Spending Compute in the Dark 事件

Name: Chiaroscuro Attention: Spending Compute in the Dark
Start: 2026-06-09

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

Chiaroscuro Attention: Spending Compute in the Dark arXiv:2606.08327v1 Announce Type: cross Abstract: Standard transformers apply self-attention uniformly at every layer and token, regardless of whether the input requires dynamic cross-token interaction. We propose CHIAR-Former (Chiaroscuro Attention), a 4-layer hybrid transformer that routes each token to one of three operators - DCT spectral mixing, RBF kernel mixing, or full self-attention - based on per-token spectral entropy, a theoretical

人工智能

关系图谱

Chiaroscuro Attention: Spending Compute in the Dark 事件

Chiaroscuro Attention: Spending Compute in the Dark · 相关报道

相关报道