Probability Distributions Computed by Autoregressive Transformers 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Probability Distributions Computed by Autoregressive Transformers arXiv:2510.27118v4 Announce Type: replace Abstract: Most expressivity results for transformers treat them as language recognizers -- devices that accept or reject strings -- rather than as they are used in practice: as language models that generate strings autoregressively and probabilistically. We characterize the probability distributions that transformer language models can express. We show that making transformer language rec

Probability Distributions Computed by Autoregressive Transformers · 相关人物