Document Details


Clip: Under review as a conference paper at ICLR 2024 SDXL:IMPROVINGLATENTDIFFUSIONMODELS FOR HIGH-RESOLUTIONIMAGESYNTHESIS Anonymous authors Paper under double-blind review ABSTRACT We presentStable Diffusion XL(SDXL), a latent diffusion model for text-to-image synthesis. Compared to previous versions ofStable Diffusion,SDXLleverages a three times larger UNet backbone, achieved by significantly increasing the number of attention blocks and including a second text encoder. Further, we design multiple novel conditioning schemes and trainSDXLon multiple aspect ratios. To ensure highest quality results, we also introduce arefinement modelwhich is used to improve the visual fidelity of samples generated bySDXLusing a post-hoc image-to-imagetechnique. We demonstrate thatSDXLimproves dramatically over previous versions ofStable Diffusionand achieves results competitive with those
Filename: pdf
Filetype: application/pdf
Size: 16271160 bytes
Uploaded On: 2024-01-22
Abstract:
Summary:
Tags:
Notes:
Visible: 1
Status: Parsed
ModDate: 2023-11-23T12:46:22+01:00
Creator: pdftk-java 3.2.2
CreationDate: 2023-11-23T12:46:22+01:00
Producer: itext-paulo-155 (itextpdf.sf.net-lowagie.com)
Pages: 13

Return to Document Library