VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing 论文

2017IEEE Transactions on Very Large Scale Integration (VLSI) Systems引用 255
Error Correcting Code TechniquesStochastic Gradient Optimization TechniquesWireless Communication Security Techniques

详细信息

发表期刊/会议
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
发表日期
2017-02-01
发表年份
2017

关键词

Error Correcting Code TechniquesStochastic Gradient Optimization TechniquesWireless Communication Security Techniques

摘要

The hardware implementation of deep neural networks (DNNs) has recently received tremendous attention: many applications in fact require high-speed operations that suit a hardware implementation. However, numerous elements and complex interconnections are usually required, leading to a large area occupation and copious power consumption. Stochastic computing (SC) has shown promising results for low-power area-efficient hardware implementations, even though existing stochastic algorithms require long streams that cause long latencies. In this paper, we propose an integer form of stochastic computation and introduce some elementary circuits. We then propose an efficient implementation of a DNN based on integral SC. The proposed architecture has been implemented on a Virtex7 field-programmable gate array, resulting in 45% and 62% average reductions in area and latency compared with the best reported architecture in the literature. We also synthesize the circuits in a 65-nm CMOS technology, and we show that the proposed integral stochastic architecture results in up to 21% reduction in energy consumption compared with the binary radix implementation at the same misclassification rate. Due to fault-tolerant nature of stochastic architectures, we also consider a quasi-synchronous implementation that yields 33% reduction in energy consumption with respect to the binary radix implementation without any compromise on performance.