OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization arXiv:2605.26092v1 Announce Type: cross Abstract: The deployment of Large Language Models (LLMs) and Vision Transformers (ViTs) on edge devices is significantly constrained by memory limitations and the critical timing bottlenecks introduced by dense Multiply-Accumulate (MAC) arrays. In the ultra-low bit regime, logarithmic Power-of-Two (PoT) quantization provides a hardware-efficient al

OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization · 相关公司

A
arXivNONPROFIT
E
EnsionCOMPANY
F
FrameworkCOMPANY
E
EATNONPROFIT
A
ACTNONPROFIT
T
ThresholdsNONPROFIT
E
EGINONPROFIT
U
UniforNONPROFIT
R
RatioRESEARCH_INSTITUTE