GenAI Meets SAR: A List of Resources

Diffusion models have demonstrated significant potential in remote sensing image generation tasks, including optical and SAR imagery. Existing research methods can be broadly categorized into two approaches: the first involves fine-tuning pre-trained models, where existing diffusion models are adapted to the remote sensing domain through transfer learning with domain-specific data; the second approach relies on end-to-end training with image-text paired data, in which diffusion models are trained from scratch without leveraging general-purpose models, aiming to learn cross-modal generation capabilities directly from remote sensing imagery and corresponding textual descriptions.

1. Cross-Modal Remote Sensing Image Generation (Supporting SAR Image Synthesis)

This category of methods focuses on leveraging pretrained diffusion models (such as Stable Diffusion) as the foundation, adapting them to remote sensing image generation tasks through fine-tuning. The generation targets include optical image synthesis and cross-modal generation from optical to SAR imagery. Compared to models trained from scratch, these approaches utilize efficient fine-tuning techniques (such as LoRA or ControlNet) to quickly adapt to remote sensing data, offering greater generalizability and computational efficiency:

Some methods employ LoRA for fine-tuning on text-image paired datasets, enabling text-controlled optical image generation;
Others incorporate ControlNet, using conditional inputs such as optical images, edge maps, or semantic segmentation maps to achieve cross-modal generation (e.g., SAR images) or structured optical image synthesis;
Certain methods further fine-tune adapters on task-specific datasets to enhance generation accuracy for particular applications.

MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation1

Diffusion-Geo: A Two-Stage Controllable Text-To-Image Generative Model for Remote Sensing Scenarios

CRS-Diff: Controllable Remote Sensing Image Generation With Diffusion Model

DiffusionSat: A Generative Foundation Model for Satellite Imagery

2.Diffusion Methods Driven by Image-Text Paired Data

This category of methods is based on large-scale image-text paired datasets to directly drive the training of diffusion models, primarily enabling text-controlled optical image generation, with some approaches further supporting cross-modal generation from optical to SAR images. Compared to fine-tuning pretrained models, these methods emphasize data-driven model construction and deep fusion of textual and visual content:

Utilizing large-scale image-text paired datasets to train diffusion models from scratch, effectively integrating textual information with metadata or temporal embeddings to achieve highly controllable optical image generation;
A few methods incorporate techniques such as ControlNet to enable cross-modal generation based on text control, extending to SAR image synthesis.

Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model

MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation

Datasets

Multi-view SAR Target Generation

The moving and stationary target acquisition and recognition (MSTAR) dataset

SAMPLE dataset

The Synthetic and Measured Paired and Labeled Experiment (SAMPLE) dataset

飞机目标多角度SAR数据集

OpenSARShip dataset

FUSARShip dataset

SAR-to-Optical Image Translation

SEN1-2: The SEN1-2 Dataset for Deep Learning in SAR-Optical Data Fusion

SAR2Opt: A Comparative Analysis of GAN-Based Methods for SAR-to-Optical Image Translation

QXS-SAROPT: The QXS-SAROPT Dataset for Deep Learning in SAR-Optical Data Fusion

SEN12MS: SEN12MS – A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data Fusion

WHU-SEN-City: SAR-to-Optical Image Translation Using Supervised Cycle-Consistent Adversarial Networks

Multi-Sensor All Weather Mapping (MSAW) Dataset: SpaceNet 6: Multi-Sensor All Weather Mapping Dataset

Experiments

We provide several baseline models based on GAN for multi-view SAR target image generation under limited observation angles. The source code can be found at ./GAN

Method

The baseline models are based on ACGAN, utilizing class label $y$ and azimuth angle $\theta$ as conditional inputs. The discriminator not only differentiates whether the input image is true or false, but also predicts the class label and azimuth angle of it. Furthermore, in order to stabilize the training process, we adopt the following techniques respectively:

SNGAN(Spectral Normalization for Generative Adversarial Networks)
LSGAN(Least Squares Generative Adversarial Networks)
DRAGAN(On convergence and stability of gans)
WGAN-GP(Improved Training of Wasserstein GANs)

Getting started

Datasets

MSTAR dataset is used in the experiment. The dataset contains ten different classes of vehicles with azimuth angle ranging from 0° to 360°.

Training

To train a GAN model, run the following command:

python train.py \
   --bs 32 \
   --lrg 0.0001 \
   --lrd 0.0001 \
   --num_epochs 500 \
   --save_dir ${SAVE_PATH} \

lrg and lrd are the learning rate of the generator and discriminator respectively

Generating

After training stage, run the following command to generate SAR target images with given label and angle information corresponding to a 15◦ depression angle.

python generate.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
GAN		GAN
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenAI Meets SAR: A List of Resources

Awesome papers

Review & Survey Papers

Electromagnetic Modeling

Statistic Modeling

Physics-Inspired GenAI Methods

AI-Empowered Physical Model

🔥 Remote Sensing Image Generation with Diffusion Models

1. Cross-Modal Remote Sensing Image Generation (Supporting SAR Image Synthesis)

2.Diffusion Methods Driven by Image-Text Paired Data

Datasets

Multi-view SAR Target Generation

SAR-to-Optical Image Translation

Experiments

Method

Getting started

Datasets

Training

Generating

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

XAI4SAR/GenAIxSAR

Folders and files

Latest commit

History

Repository files navigation

GenAI Meets SAR: A List of Resources

Awesome papers

Review & Survey Papers

Electromagnetic Modeling

Statistic Modeling

Physics-Inspired GenAI Methods

AI-Empowered Physical Model

🔥 Remote Sensing Image Generation with Diffusion Models

1. Cross-Modal Remote Sensing Image Generation (Supporting SAR Image Synthesis)

2.Diffusion Methods Driven by Image-Text Paired Data

Datasets

Multi-view SAR Target Generation

SAR-to-Optical Image Translation

Experiments

Method

Getting started

Datasets

Training

Generating

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages