Deep Layers as Stochastic Solvers

上传者：announce1599 2021-01-24 04:59:39上传 .PDF文件 442.66 KB 热度 25次

Deep Layers as Stochastic Solvers

We provide a novel perspective on the forward pass through a block of layers in a deep network. In particular, we show that a forward pass through a standard dropout layer followed by a linear layer and a non-linear activation is equivalent to optimizing a convex optimization objective with a single iteration of a $\tau$-nice Proximal Stochastic Gradient method.We further show that replacing standard Bernoulli dropout with additive dropout is equivalent to optimizing the same convex objective with a variance-reduced proximal method. By expressing both fully-connected and convolutional layers as special cases of a high-order tensor product, we unify the underlying convex optimization problem in the tensor setting and derive a formula for the Lipschitz constant $L$ used to determine the optimal step size of the above proximal methods. We conduct experiments with standard convolutional networks applied to the CIFAR-10 and CIFAR-100 datasets, and show that replacing a block of layers with multiple iterations of the corresponding solver, with step size set via $L$, consistently improves classification accuracy.

深层作为随机解算器

我们提供了有关通过深度网络中的各个层进行正向传递的新颖观点。特别是，我们表明向前穿过标准辍学层，然后经过线性层和非线性激活等效于通过一次迭代来优化凸优化目标。 τ -nice近邻随机梯度法。.. 我们进一步表明，用加性滤除替换标准的伯努利滤除等效于使用减少方差的近端方法优化相同的凸物镜。通过将全连通层和卷积层都表示为高阶张量积的特例，我们统一了张量设置中的基本凸优化问题，并导出了Lipschitz常数的公式大号用于确定上述近端方法的最佳步长。我们对应用于CIFAR-10和CIFAR-100数据集的标准卷积网络进行了实验，结果表明，用相应求解器的多次迭代替换了一层图层，并通过大号，持续提高分类准确性。（阅读更多）

下载地址

用户评论

更多下载

下载地址

 立即下载

用户评论

发表评论

Deep Layers as Stochastic Solvers

我们提供了有关通过深度网络中的各个层进行正向传递的新颖观点。特别是，我们表明向前穿过标准辍学层，然后...

大小：442.66 KB | 2021-01-24 04:59:39

Stochastic Pooling for Regularization of Deep Convolutional Neur

Stochastic Pooling for Regularization of Deep Conv...

大小：1.05MB | 2023-01-26 20:23:42

layers源码

layers

大小：15KB | 2021-04-19 05:20:35

2018A PID Controller Approach for Stochastic Optimization of Deep Networks.zip

2018年CVPR论文A PID Controller Approach for Stochasti...

大小：22.63MB | 2021-04-18 10:44:17

An Introduction to Iterative Toeplitz Solvers

AnIntroductiontoIterativeToeplitzSolversforsolving...

大小：0B | 2019-07-16 05:16:51

Carsim_Programs_solvers_ReadMe.rar

carsim和simulink联合仿真时输入输出信号的选择,可能由于版本问题有些carsim缺少这个...

大小：1.76MB | 2021-04-26 02:16:41

Layers学习笔记

本文提供电信管理论坛(TMF)的MTNM（多技术网络管理）NML-EML接口和MTOSI（多技术OS...

大小：0B | 2020-05-23 23:59:35

Introduction to OpenFOAM's Linear System Solvers

If you are looking for a reliable and efficient me...

大小：686.58KB | 2023-05-29 04:40:01

OpenCL Layers Tutorial源码

OpenCL层教程

大小：503KB | 2021-05-01 05:06:09

802.11PHY Layers

ExplainPHYLayerterminologyusedinthe802.11seriesofs...

大小：0B | 2019-09-24 08:03:29

Base Layers crx插件

语言:English 基本层网格叠加通过此Chrome扩展程序,您可以切换基本层CSS框架的网格覆...

大小：36KB | 2021-04-19 05:20:28

VGGILSVRC16layers

大小：0B | 2019-01-19 03:23:31

Fast iterative solvers for numerical simulations of scattering and radiation on

Fast iterative solvers for numerical simulations o...

大小：332KB | 2021-02-25 11:42:03

Sparse Matrix Solvers on the GPU Conjugate Gradients and Multigrid

SparseMatrixSolversontheGPU:ConjugateGradientsandM...

大小：0B | 2019-05-28 18:55:18

MATLAB OPTI Toolbox to call optimization solvers

MATLAB’s Optimization Toolbox, 调用 solvers: Clp, Cb...

大小：0B | 2018-12-09 10:34:36

Riemann Solvers and Numerical Methods for Fluid Dynamics

大小：0B | 2018-12-07 15:51:41