What is: DALL·E 2?
Source | Hierarchical Text-Conditional Image Generation with CLIP Latents |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
DALL·E 2 is a generative text-to-image model made up of two main components: a prior that generates a CLIP image embedding given a text caption, and a decoder that generates an image conditioned on the image embedding.