TransformerEngine 产品
来源: githubOPEN_SOURCE开源PythonApache-2.0发布于 2022-09-20
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
3334
Stars
721
Forks
4
技术栈
0
替代方案
0
相关事件
TransformerEngine · 替代方案
暂无数据