inference 产品
来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2023-06-14
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
9346
Stars
835
Forks
4
技术栈
0
替代方案
50
相关事件
inference · 相关文章
相关文章
Show HN: BonzAI – self-sovereign, local LLM inference in the browser
news.ycombinator.com2026-05-22
Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint
news.ycombinator.com2026-05-18
UK sovereign LLM inference
news.ycombinator.com2026-05-15
Elasticsearch 9.4.1 发布
开源中国2026-05-13
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
news.ycombinator.com2026-05-12
Building Blocks for Foundation Model Training and Inference on AWS
Hugging Face Blog2026-05-11
DeepInfra on Hugging Face Inference Providers 🔥
Hugging Face Blog2026-04-29
OVHcloud on Hugging Face Inference Providers 🔥
Hugging Face Blog2025-11-24
Scaleway on Hugging Face Inference Providers 🔥
Hugging Face Blog2025-09-19
Public AI on Hugging Face Inference Providers 🔥
Hugging Face Blog2025-09-17