SDXL IRs and Scripts

SDXL end-to-end benchmarking

Checkout and compile IREE with release build and export PATH=/path/to/iree/build/release/tools:$PATH
Compile the full SDXL model: ./compile-txt2img.sh gfx942 (where gfx942 is the target for MI300X)
Run the benchmark: ./benchmark-txt2img.sh N /path/to/weights/irpa (where N is the GPU index)

Model IRs and weights

Caution

IRs in the following table might be stale. Use the ones in the base_ir/ directory instead.

Note

SDXL-turbo is only different from SDXL in its usage and training/weights. The model architecture (and therefore the weights-stripped MLIR) are equivalent.

Variant	Submodel	MLIR (No Weights) (Config A)	safetensors	Splat IRPA	MLIR (No Weights) (Config B)
SDXL1.0 1024x1024 (f16, BS1, len64)
	UNet + attn	Torch - Linalg	-	-	Azure
	UNet + PNDMScheduler	Azure
	Clip1	Azure	-	-
	Clip2	Azure	-	-
	VAE decode + attn	Azure	-	=	Azure
	VAE encode + attn	[GCloud][sdxl-1-1024x1024-f16-stripped-weight-vae-encode]	Same as decode	-	-
SDXL1.0 1024x1024 (f32, BS1, len64)
	UNet + attn	Azure	Azure	Azure	Azure
	Clip1	Azure	Azure	Azure	-
	Clip2	Azure	Azure	Azure	-
	VAE decode + attn	Azure	Azure	Azure	Azure
SDXL compiled pipeline IRPAs (f16)
	UNet	scheduled_unet_f16.irpa
	Prompt Encoder (CLIP1 + CLIP2)	prompt_encoder_f16.irpa
	VAE	vae_decode_f16.irpa

Name		Name	Last commit message	Last commit date
Latest commit History 261 Commits
.github/workflows		.github/workflows
bitcode-2024-03-07		bitcode-2024-03-07
bitcode-6.1.2		bitcode-6.1.2
docker		docker
fp16-model		fp16-model
int8-model		int8-model
tuning		tuning
validating_accuracy		validating_accuracy
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
correlator.py		correlator.py
gpu_ids.py		gpu_ids.py
power_trace.sh		power_trace.sh
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDXL IRs and Scripts

SDXL end-to-end benchmarking

Model IRs and weights

About

Releases

Packages

Contributors 17

Languages

License

nod-ai/sdxl-scripts

Folders and files

Latest commit

History

Repository files navigation

SDXL IRs and Scripts

SDXL end-to-end benchmarking

Model IRs and weights

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 17

Languages

Packages