DPU or GPU for Accelerating Neural Networks Inference -- Why not both? Split CNN Inference 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

DPU or GPU for Accelerating Neural Networks Inference -- Why not both? Split CNN Inference arXiv:2605.00174v2 Announce Type: replace-cross Abstract: Video and image streaming on edge devices requires low latency. To address this, Neural Networks (NNs) are widely used, and prior work mainly focuses on accelerating them with single hardware units such as Graphics Processing Units (GPUs), Field Programmable Gate Arrays (FPGAs), and Deep Learning Processing Units (DPUs). However, further reductions