Prototype Transformer: Towards Language Model Architectures Interpretable by Design 事件

BREAKTHROUGH2026-06-02影响: HIGH

Prototype Transformer: Towards Language Model Architectures Interpretable by Design arXiv:2602.11852v2 Announce Type: replace-cross Abstract: While state-of-the-art language models (LMs) surpass most humans in certain domains, their reasoning remains largely opaque, reducing trust and increasing the risk of deception and hallucination. We introduce the Prototype Transformer (ProtoT), an autoregressive LM architecture that replaces the quadratic-cost self-attention module of the Transformer with