FlexFlow: A Flexible Dataflow Accelerator Architecture for Convolutional Neural Networks 论文

2017引用 335
Advanced Neural Network ApplicationsAdversarial Robustness in Machine LearningAdvanced Memory and Neural Computing

详细信息

发表日期
2017-02-01
发表年份
2017

关键词

Advanced Neural Network ApplicationsAdversarial Robustness in Machine LearningAdvanced Memory and Neural Computing

摘要

Convolutional Neural Networks (CNN) are very computation-intensive. Recently, a lot of CNN accelerators based on the CNN intrinsic parallelism are proposed. However, we observed that there is a big mismatch between the parallel types supported by computing engine and the dominant parallel types of CNN workloads. This mismatch seriously degrades resource utilization of existing accelerators. In this paper, we propose a flexible dataflow architecture (FlexFlow) that can leverage the complementary effects among feature map, neuron, and synapse parallelism to mitigate the mismatch. We evaluated our design with six typical practical workloads, it acquires 2-10x performance speedup and 2.5-10x power efficiency improvement compared with three state-of-the-art accelerator architectures. Meanwhile, FlexFlow is highly scalable with growing computing engine scale.