Viet-Anh on Software Logo

What is: Modulated Residual Network?

SourceModulating early visual processing by language
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

MODERN, or Modulated Residual Network, is an architecture for visual question answering (VQA). It employs conditional batch normalization to allow a linguistic embedding from an LSTM to modulate the batch normalization parameters of a ResNet. This enables the linguistic embedding to manipulate entire feature maps by scaling them up or down, negating them, or shutting them off, etc.