What is: VideoBERT?
Source | VideoBERT: A Joint Model for Video and Language Representation Learning |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
VideoBERT adapts the powerful BERT model to learn a joint visual-linguistic representation for video. It is used in numerous tasks, including action classification and video captioning.