GitHub - dfki-av/STDS: Spatio-Temporal Diffusion Model for Satellite Imagery

STDS: Spatio-Temporal Diffusion model for Satellite Imagery

This is the implementation of STDS

This repository is a fork from a transformer-based video diffusion model called latte.

Setup

First, download and set up the repo:

git clone https://github.com/dfki-av/STDS
cd STDS

We provide an environment.yml file that can be used to create a Conda environment. If you only want to run pre-trained models locally on CPU, you can remove the cudatoolkit and pytorch-cuda requirements from the file.

conda env create -f environment.yml
conda activate stds

Sampling

You can sample from our pre-trained Latte models with sample.py. To use the STDS model optimally, it is recommended to train the model with your own dataset.

This model has been trained by using dataset collected from Google Earth Engine Weights for our pre-trained Latte model can be found here. The script has various arguments to adjust sampling steps, change the classifier-free guidance scale, etc.

If you would like to measure the quantitative metrics of your generated results, please refer to here.

Training

We provide a training script for Latte in train.py. The structure of the datasets can be found here. This script can be used to train class-conditional and unconditional Latte models. To launch Latte (256x256) training with N GPUs on the FaceForensics dataset :

torchrun --nnodes=1 --nproc_per_node=N train.py --config ./configs/ffs/ffs_train.yaml

or If you have a cluster that uses slurm, you can also train Latte's model using the following scripts:

sbatch slurm_scripts/ffs.slurm

We also provide the video-image joint training scripts train_with_img.py. Similar to train.py scripts, these scripts can be also used to train class-conditional and unconditional Latte models. If you are familiar with PyTorch Lightning, you can also use the training script train_pl.py and train_with_img_pl.py provided by @zhang.haojie,

python train_pl.py --config ./configs/ffs/ffs_train.yaml

or

python train_with_img_pl.py --config ./configs/ffs/ffs_img_train.yaml

This script automatically detects available GPUs and uses distributed training.

Contact Us

Prathap Kashyap: [prathapnkashyap@gmail.com]

Citation

If you find this work useful for your research, please consider citing it.

@inproceedings{kashyap2025spatiotemporal,
  title={Spatiotemporal diffusion model for satellite imagery},
  author={Kashyap, Prathap Nagaraj and Javanmardi, Alireza and Jaiswal, Pragati and Reis, Gerd and Pagani, Alain and Stricker, Didier},
  booktitle={Eleventh International Conference on Remote Sensing and Geoinformation of the Environment (RSCy2025)},
  volume={13816},
  pages={376--384},
  year={2025},
  organization={SPIE}
}

Acknowledgments

STDS has been greatly inspired by the following amazing works and teams: Latte DiT and PixArt-α, we thank all the contributors for open-sourcing.

License

The code and model weights are licensed under LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github/workflows		.github/workflows
configs		configs
datasets		datasets
diffusion		diffusion
docs		docs
models		models
sample		sample
slurm_scripts		slurm_scripts
tools		tools
train_scripts		train_scripts
visuals		visuals
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
latte_huggingface.py		latte_huggingface.py
train.py		train.py
train_pl.py		train_pl.py
train_with_img.py		train_with_img.py
train_with_img_pl.py		train_with_img_pl.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

STDS: Spatio-Temporal Diffusion model for Satellite Imagery

Setup

Sampling

Training

Contact Us

Citation

Acknowledgments

License

About

Uh oh!

Releases

Packages

Languages

License

dfki-av/STDS

Folders and files

Latest commit

History

Repository files navigation

STDS: Spatio-Temporal Diffusion model for Satellite Imagery

Setup

Sampling

Training

Contact Us

Citation

Acknowledgments

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages