When NPUs Are Not Always Faster: A Stage-Level Analysis of Mobile LLM Inference 文章

ArXiv CS.AI2026-05-28NEWSen作者: Pu Li, Jiawen Qi, Qinyu Chen

When NPUs Are Not Always Faster: A Stage-Level Analysis of Mobile LLM Inference · 相关技术

相关技术