Fine-tuning LLMs to 1.58bit: extreme quantization made easy 文章

Hugging Face Blog2024-09-18BLOGen