What is: TrIVD-GAN?
Source | Transformation-based Adversarial Video Prediction on Large-Scale Data |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
TrIVD-GAN, or Transformation-based & TrIple Video Discriminator GAN, is a type of generative adversarial network for video generation that builds upon DVD-GAN. Improvements include a novel transformation-based recurrent unit (the TSRU) that makes the generator more expressive, and an improved discriminator architecture.
In contrast with DVD-GAN, TrIVD-GAN has an alternative split for the roles of the discriminators, with judging per-frame global structure, while critiques local spatiotemporal structure. This is achieved by downsampling the randomly sampled frames fed to by a factor , and cropping clips inside the high resolution video fed to , where correspond to time, height, width and channel dimension of the input. This further reduces the number of pixels to process per video, from to .