FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels 文章

news.ycombinator.com2026-05-12NEWSen作者: PaulHoule