Document Details

pdf

Download View Text Delete

Clip: Under review as a conference paper at ICLR 2024 SDXL:IMPROVINGLATENTDIFFUSIONMODELS FOR HIGH-RESOLUTIONIMAGESYNTHESIS Anonymous authors Paper under double-blind review ABSTRACT We presentStable Diffusion XL(SDXL), a latent diffusion model for text-to-image synthesis. Compared to previous versions ofStable Diffusion,SDXLleverages a three times larger UNet backbone, achieved by significantly increasing the number of attention blocks and including a second text encoder. Further, we design multiple novel conditioning schemes and trainSDXLon multiple aspect ratios. To ensure highest quality results, we also introduce arefinement modelwhich is used to improve the visual fidelity of samples generated bySDXLusing a post-hoc image-to-imagetechnique. We demonstrate thatSDXLimproves dramatically over previous versions ofStable Diffusionand achieves results competitive with those

Filename: pdf

Filetype: application/pdf

Size: 16271160 bytes

Uploaded On: 2024-01-22

Abstract:

Summary:

Tags:

Notes:

Visible: 1

Status: Parsed

ModDate: 2023-11-23T12:46:22+01:00

Creator: pdftk-java 3.2.2

CreationDate: 2023-11-23T12:46:22+01:00

Producer: itext-paulo-155 (itextpdf.sf.net-lowagie.com)

Pages: 13

Return to Document Library