The Reversible Residual Network: Backpropagation Without Storing Activations 论文

2017arXiv (Cornell University)引用 228

Advanced Neural Network ApplicationsAdversarial Robustness in Machine LearningMedical Imaging and Analysis

人工智能 Advanced Neural Network Applications Adversarial Robustness in Machine Learning Medical Imaging and Analysis

作者

摘要

Residual Networks (ResNets) have demonstrated significant improvement over traditional Convolutional Neural Networks (CNNs) on image classification, increasing in performance as networks grow both deeper and wider. However, memory consumption becomes a bottleneck as one needs to store all the intermediate activations for calculating gradients using backpropagation. In this work, we present the Reversible Residual Network (RevNet), a variant of ResNets where each layer's activations can be reconstructed exactly from the next layer's. Therefore, the activations for most layers need not be stored in memory during backprop. We demonstrate the effectiveness of RevNets on CIFAR and ImageNet, establishing nearly identical performance to equally-sized ResNets, with activation storage requirements independent of depth.

作者查看全部 (4)

Roger Grosse

Raquel Urtasun

Mengye Ren

Aidan N. Gomez

The Reversible Residual Network: Backpropagation Without Storing Activations 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章