Triplet-Block Diffusion RWKV 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
Triplet-Block Diffusion RWKV arXiv:2605.25969v1 Announce Type: new Abstract: Causal Transformer language models suffer from strictly sequential decoding and a quadratic per-step attention cost. While linear-time causal models and discrete diffusion models each address these weaknesses, their integration remains inherently inconsistent: diffusion requires bidirectional attention, while causal models are unidirectional. To unify these architectures, we propose $B^3D-RWKV$, a diffusion RWKV varian