Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon 文章

Hugging Face Blog2023-05-16BLOGen