DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale 论文

2022引用 218
Advanced Neural Network ApplicationsTopic ModelingAdvanced Graph Neural Networks

DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale · 相关技术