Profiling-Driven Adaptive Distributed Transformer Inference on Embedded Edge Deployment 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Profiling-Driven Adaptive Distributed Transformer Inference on Embedded Edge Deployment arXiv:2605.25682v1 Announce Type: cross Abstract: Distributing Transformer inference across embedded edge devices can alleviate individual memory and compute constraints, yet practical benefits on real hardware remain unclear: prior work relies largely on simulations that overlook hardware-specific communication overheads. We present a hardware prototype study on NVIDIA Jetson Orin Nano devices connected ove