AvatarGAN

A conditional Generative Adversarial Network (cGAN) that generates cartoon avatars conditioned on discrete facial attributes. Built on the CartoonSet dataset.

Overview

AvatarGAN learns to generate 128×128 RGB cartoon faces that match a given combination of facial attributes. Both the Generator and the Discriminator receive attribute embeddings alongside the image/noise input, which guides the model towards attribute-consistent outputs.

Architecture

Component	Description
AttributeBlock	Per-attribute two-layer MLP that refines each embedding before it is concatenated
Generator	Latent vector `z` + attribute embeddings → fully-connected layers → 128×128 RGB image (Tanh output)
Discriminator	Flattened image + L2-normalised attribute embeddings → fully-connected layers → real/fake score (Sigmoid)

Attributes

Four attributes are used, each encoded as a learned embedding:

Attribute	Variants
`facial_hair`	15
`hair`	111
`face_color`	11
`hair_color`	10

Hyperparameters

Parameter	Value
Image size	128 × 128
Latent dimension	128
Embedding dimension	64
Batch size	64
Epochs	360
Generator LR	0.0002
Discriminator LR	0.0001
Optimizer	Adam (β₁=0.5, β₂=0.999)
Loss	Binary Cross-Entropy

Dataset

CartoonSet10k — 10,000 cartoon avatar images, each paired with a CSV file describing its visual attributes. Place the dataset at ../cartoonset10k relative to the notebook.

cartoonset10k/
├── 00000.png
├── 00000.csv
├── 00001.png
├── 00001.csv
└── ...

Requirements

torch
torchvision
Pillow
matplotlib

Install with:

pip install torch torchvision Pillow matplotlib

Usage

Open and run avatargan.ipynb in Jupyter. The notebook is organised into the following sections:

Imports and Configuration — libraries and global hyperparameters
Dataset — CustomImageDataset with CSV attribute loading
Model Architecture — AttributeBlock, Generator, Discriminator
Dataset and Model Setup — dataloader, model initialisation, optimizers
Helper Functions — checkpoint saving, fixed sample preparation
Training — main training loop with per-epoch visualisation

Checkpoints are saved every 20 epochs to the models4/ directory.

Training Progress

Every epoch a side-by-side comparison of 6 fixed original images and their generated counterparts is displayed, allowing visual tracking of generation quality over time.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
avatargan.ipynb		avatargan.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AvatarGAN

Overview

Architecture

Attributes

Hyperparameters

Dataset

Requirements

Usage

Training Progress

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AvatarGAN

Overview

Architecture

Attributes

Hyperparameters

Dataset

Requirements

Usage

Training Progress

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages