Viet-Anh on Software Logo

What is: (2+1)D Convolution?

SourceA Closer Look at Spatiotemporal Convolutions for Action Recognition
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

A (2+1)D Convolution is a type of convolution used for action recognition convolutional neural networks, with a spatiotemporal volume. As opposed to applying a 3D Convolution over the entire volume, which can be computationally expensive and lead to overfitting, a (2+1)D convolution splits computation into two convolutions: a spatial 2D convolution followed by a temporal 1D convolution.