CelebAMask-HQ++ and SemanticStyleGAN inversion manipulation

^{This is the main repository for the IBB project on FRI. The repository was created for a project that was part of Image Based Biometry course at the University of Ljubljana, Faculty for computer and information science.}

To use this repository as intended you should have a NVIDIA GPU with appropriate NVIDIA driver and CUDA versions that are compatible with PyTorch and Tensorflow.

More instructions on how to use the repository can be found in the main branch of the repository.

Abstract

In this work, we present the extended version of the already existing CelebAMask-HQ dataset with images and corresponding segmentation masks, which allow for even more fine-grained control of the structure and texture of the facial region, more specifically, glasses. The extended version of the dataset, called CelebAMask-HQ++, adds manually annotated semantic masks of glasses lenses, glasses types, and glasses landmarks. In total, 1548 images of people with glasses have been updated with a segmentation mask, where the previous ‘eyeglasses’ has now been extended to ‘glasses frames’ and ‘glasses lenses’. Additionally, all the images of glasses were annotated with glasses landmarks and glasses types. Finally, we explored and found better optimization schemes for embedding in SemanticStyleGAN latent space with the help of segmentation masks to get noticeably better segmentation masks and image embeddings, which yielded better results for downstream tasks like style transfer.

New annotations

Example of new landmark annotations, blended with segmentation masks and the original image Inversion results of an image, with regular optimization and with added segmentation mask, as well as the generator trained only with glasses images from the updated dataset.

Example of new segmentation maps

Processed segmentation maps with erosion

Improved inversion

Improved inversion of both image and segmentation mask

Improved generators with the new dataset

Improved style mixing

Style mixing before

Style mixing after

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
LICENSES		LICENSES
assets		assets
criteria		criteria
data		data
docs		docs
models		models
utils		utils
visualize		visualize
.gitignore		.gitignore
README.md		README.md
calc_fid.py		calc_fid.py
copy_images_from_results.py		copy_images_from_results.py
prepare_image_data.py		prepare_image_data.py
prepare_inception.py		prepare_inception.py
prepare_mask_data.py		prepare_mask_data.py
requirements.txt		requirements.txt
train.py		train.py
train_adaptation.py		train_adaptation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CelebAMask-HQ++ and SemanticStyleGAN inversion manipulation

Abstract

New annotations

Improved inversion

Improved style mixing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CelebAMask-HQ++ and SemanticStyleGAN inversion manipulation

Abstract

New annotations

Improved inversion

Improved style mixing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages