Gradient descent optimizes over-parameterized deep ReLU networks 论文

2019Machine Learning引用 226
Advanced Neural Network ApplicationsStochastic Gradient Optimization TechniquesDomain Adaptation and Few-Shot Learning