What is: Guided Language to Image Diffusion for Generation and Editing?
Source | GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
GLIDE is a generative model based on text-guided diffusion models for more photorealistic image generation. Guided diffusion is applied to text-conditional image synthesis and the model is able to handle free-form prompts. The diffusion model uses a text encoder to condition on natural language descriptions. The model is provided with editing capabilities in addition to zero-shot generation, allowing for iterative improvement of model samples to match more complex prompts. The model is fine-tuned to perform image inpainting.