What is: Stochastic Depth?
Source | Deep Networks with Stochastic Depth |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Stochastic Depth aims to shrink the depth of a network during training, while keeping it unchanged during testing. This is achieved by randomly dropping entire ResBlocks during training and bypassing their transformations through skip connections.
Let {} denote a Bernoulli random variable, which indicates whether the th ResBlock is active () or inactive (). Further, let us denote the “survival” probability of ResBlock as . With this definition we can bypass the th ResBlock by multiplying its function with and we extend the update rule to:
If , this reduces to the original ResNet update and this ResBlock remains unchanged. If , the ResBlock reduces to the identity function, .