Diffusion Transformer

Implementation of the Diffusion Transformer model in the paper:

Scalable Diffusion Models with Transformers.

See here for the official Pytorch implementation.

Dependencies

Python 3.8
TensorFlow 2.12

Training AutoencoderKL

Use --train_file_pattern=<file_pattern> and --test_file_pattern=<file_pattern> to specify the train and test dataset path.

python ae_train.py --train_file_pattern='./train_dataset_path/*.png' --test_file_pattern='./test_dataset_path/*.png'

Training Diffusion Transformer

Use --file_pattern=<file_pattern> to specify the dataset path.

python ldt_train.py --file_pattern='./dataset_path/*.png'

*Training DiT requires the pretrained AutoencoderKL. Use ae_dir and ae_name to specify the AutoencoderKL path in the ldt_config.py file.

Sampling

Use --model_dir=<model_dir> and --ldt_name=<ldt_name> to specify the pre-trained model. For example:

python sample.py --model_dir=ldt --ldt_name=model_1 --diffusion_steps=40

Hparams setting

Adjust hyperparameters in the ae_config.py and ldt_config.py files.

Implementation notes:

LDT is designed to offer reasonable performance using a single GPU (RTX 3080 TI).
LDT largely follows the original DiT model.
DiT Block with adaLN-Zero.
Diffusion Transformer with Linformer attention.
Cosine schedule.
DDIM sampler.
FID evaluation.
AutoencoderKL with PatchGAN discriminator and hinge loss.
This implementation uses code from the beresandras repo. Under MIT Licence.

Samples

Curated samples from FFHQ

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
images		images
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
ae_config.py		ae_config.py
ae_train.py		ae_train.py
ae_trainer.py		ae_trainer.py
autoencoder.py		autoencoder.py
discriminator.py		discriminator.py
dit.py		dit.py
fid.py		fid.py
ldt_config.py		ldt_config.py
ldt_train.py		ldt_train.py
ldt_trainer.py		ldt_trainer.py
sample.py		sample.py
schedule.py		schedule.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Diffusion Transformer

Dependencies

Training AutoencoderKL

Training Diffusion Transformer

Sampling

Hparams setting

Samples

Licence

About

Uh oh!

Releases 1

Packages

Languages

License

milmor/diffusion-transformer-keras

Folders and files

Latest commit

History

Repository files navigation

Diffusion Transformer

Dependencies

Training AutoencoderKL

Training Diffusion Transformer

Sampling

Hparams setting

Samples

Licence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages