Faster Text Generation with Self-Speculative Decoding 文章

Hugging Face Blog2024-11-20BLOGen