What is: AutoDropout?
Source | AutoDropout: Learning Dropout Patterns to Regularize Deep Networks |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
AutoDropout automates the process of designing dropout patterns using a Transformer based controller. In this method, a controller learns to generate a dropout pattern at every channel and layer of a target network, such as a ConvNet or a Transformer. The target network is then trained with the dropped-out pattern, and its resulting validation performance is used as a signal for the controller to learn from. The resulting pattern is applied to a convolutional output channel, which is a common building block of image recognition models.
The controller network generates the tokens to describe the configurations of the dropout pattern. The tokens are generated like words in a language model. For every layer in a ConvNet, a group of 8 tokens need to be made to create a dropout pattern. These 8 tokens are generated sequentially. In the figure above, size, stride, and repeat indicate the size and the tiling of the pattern; rotate, shear_x, and shear_y specify the geometric transformations of the pattern; share_c is a binary deciding whether a pattern is applied to all channels; and residual is a binary deciding whether the pattern is applied to the residual branch as well. If we need dropout patterns, the controller will generate decisions.