FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels 文章

news.ycombinator.com2026-05-12NEWSen作者: PaulHoule

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels · 相关技术

暂无数据