Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 文章

Hugging Face Blog2025-01-16BLOGen