TransformerEngine 产品

来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2022-09-20

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

3334

Stars

721

Forks

4

技术栈

0

替代方案

0

相关事件

TransformerEngine · 替代方案

暂无数据