What is: Simple Visual Language Model?
Source | SimVLM: Simple Visual Language Model Pretraining with Weak Supervision |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
SimVLM is a minimalist pretraining framework to reduce training complexity by exploiting large-scale weak supervision. It is trained end-to-end with a single prefix language modeling (PrefixLM) objective. PrefixLM enables bidirectional attention within the prefix sequence, and thus it is applicable for both decoder-only and encoder-decoder sequence-to-sequence language models.