Probability Distributions Computed by Autoregressive Transformers 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Probability Distributions Computed by Autoregressive Transformers arXiv:2510.27118v4 Announce Type: replace Abstract: Most expressivity results for transformers treat them as language recognizers -- devices that accept or reject strings -- rather than as they are used in practice: as language models that generate strings autoregressively and probabilistically. We characterize the probability distributions that transformer language models can express. We show that making transformer language rec