What is: MoViNet?
Source | MoViNets: Mobile Video Networks for Efficient Video Recognition |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
Mobile Video Network, or MoViNet, is a type of computation and memory efficient video network that can operate on streaming video for online inference. Three techniques are used to improve efficiency while reducing the peak memory usage of 3D CNNs. First, a video network search space is designed and neural architecture search employed to generate efficient and diverse 3D CNN architectures. Second, a Stream Buffer technique is introduced that decouples memory from video clip duration, allowing 3D CNNs to embed arbitrary-length streaming video sequences for both training and inference with a small constant memory footprint. Third, a simple ensembling technique is used to improve accuracy further without sacrificing efficiency.