inference 产品

来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2023-06-14

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

9346

Stars

835

Forks

4

技术栈

0

替代方案

50

相关事件

inference · 相关事件

相关事件

GITCO: Gated Inference-Time Context Optimization in TSFMs
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
AdaMEM: Test-Time Adaptive Memory for Language Agents
2026-06-06PRODUCT_LAUNCH影响: MEDIUM
IR3DE: A Linear Router for Large Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
On Advantage Estimates for Max@K Policy Gradients
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
Unsupervised Skill Discovery for Agentic Data Analysis
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
A Survey on Diffusion Language Models
2026-06-05PRODUCT_LAUNCH影响: MEDIUM
A Survey on Diffusion Language Models
2026-06-05BREAKTHROUGH影响: HIGH