Reproduction of Privacy-Preserving Action Recognition via Motion Difference Quantization

This repository contains our reproduction and extension of the BDQ encoder from the paper Privacy-Preserving Action Recognition via Motion Difference Quantization [1]. We replicate the results on the KTH dataset [2] and additionally evaluate on the IXMAS dataset [3].

Project Overview

We implemented the BDQ encoder consisting of three modules, Blur, Difference, and Quantization. The encoder is trained adversarially to retain action features while suppressing privacy-sensitive ones.

Directory Structure

Core Components

bdq_encoder/
- BDQ.py: high-level wrapper for the BDQ encoder
- BDQ_disabled: a passthrough version of the encoder that disables all BDQ transformations, used for ablation or baseline comparisons
- blur.py, difference.py, quantization.py: module-level definitions

Datasets & Metadata

Raw datasets available at:
- KTH Dataset
- IXMAS Dataset
datasets/
- datasets/KTH: video files and 00sequences.txt (metadata)
- datasets/IXAMS: selected subset of IXMAS videos
- datasets/util_kth.py: action label map of the KTH dataset
- datasets/ixmas_clips_6.json, datasets/kth_clips.json: structured clip metadata generated from parsers

Notebooks

notebooks/
- notebooks/bdq-encoder.ipynb: trains and evaluates the full BDQ encoder
- notebooks/bdq-no-encoder.ipynb: baseline experiment without BDQ encoder

Preprocessing Scripts

preprocess.py: preprocessing pipeline
raw_dataset_preprocess: dataset-specific preprocessing tools
- raw_dataset_preprocess/KTH_preprocess
  - .../KTH_preprocess/kth_parser.py: parses 00sequences.txt into structured KTH clip metadata
- raw_dataset_preprocess/IXMAS_preprocess
  - .../IXMAS_preprocess/IXMAS_720: 720 manually selected representative frames (10 subject × 12 classes × 2 viewpoints × 3 takes)
  - .../IXMAS_preprocess/IXMAS_utils
    - .../ixmas_extract_frame.py: extracts and saves a representative frame from each IXMAS video
    - .../IXMAS_utils/ixmas_extract_vid.py: locates and copies videos matching selected frame names (e.g., for IXMAS_720/)
    - .../IXMAS_utils/ixmas_parser.py: generates clip metadata from IXMAS video filenames

Models & Training

checkpoint_25.tar: checkpoint containing model weights saved after 25th epoch of training on KTH dataset. Note that because of GitHub limitations, file had to be split and needs to be reassembled from pieces using command cat checkpoint_25_part_* > checkpoint_25.tar
loss.py: loss functions for adversarial training
training.py: training and validation pipeline
training_no_encoder.py: an identical pipeline that bypasses BDQ transformations, used for ablation or baseline comparison
action_recognition_model.py: 3D ResNet-based action classifier
privacy_attribute_prediction_model.py: 2D ResNet-based identity classifier

Visualization & Evaluation

visualization/: logs and scripts for plotting results and evaluating BDQ performance
- logs/: TensorBoard-compatible log directories containing training and validation metrics
- pics/: generated plots
- quantization_steps.py: saves the initialized and learned quantization bin values during training
- figure3_row1.py: extracts final accuracy metrics from TensorBoard logs and plots the utility–privacy trade-off curve (Figure 3 row 1)
- figure3_row2.py: visualizes quantization steps using bin values (Figure 3 row 2)
- train_val_acc.py: plots training and validation accuracy curves over epochs for action and privacy prediction
- val_acc_comp.py: compares validation accuracy between different model variants (e.g., with and without BDQ)

Run the Project

Run On Kaggle

Generate a GitHub Personal Access Token (PAT).
Upload the notebook from /notebooks to Kaggle.
Add the token as a Kaggle secret in your notebook: Add-ons -> Secrets -> Add Secret.
Access the secret in the notebook by replacing the placeholder:

user_secrets = UserSecretsClient()
token = user_secrets.get_secret("your_secret_name") # replace with the actual name

Specify the dataset to run:

%cd /kaggle/working/frmdl25-6/
from training import main
main('kth')

Enable GPU: Settings -> Accelerator -> select GPU P100.

Run Locally

Note: Training may be slow without a CUDA-enabled GPU.

Install requirements:

pip install -r requirements.txt

Adversarial training:

python training.py --dataset kth
# or
python training.py --dataset ixmas

Reproduce Figure 3

To replicate the two subplots in Figure 3 of the original BDQ paper using saved logs, run the following commands:

python visualization/figure3_row1.py
python visualization/figure3_row2.py

These scripts read from log files and generate the corresponding plots in visualization/pics/.

BDQ Encoder Samples

Below are some sample outputs of the BDQ encoder after training for 25 epochs on the KTH dataset (see checkpoint_25.tar), which demonstrate the transformations applied to the input video frames:

References

[1] S. Kumawat and H. Nagahara, “Privacy-Preserving Action Recognition via Motion Difference Quantization,” Aug. 2022.

[2] C. Schuldt, I. Laptev, and B. Caputo, “Recognizing human actions: A local SVM approach,” in Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004., (Cambridge, UK), pp. 32–36 Vol.3, IEEE, 2004.

[3] D. Weinland, R. Ronfard, and E. Boyer, “Free viewpoint action recognition using motion history volumes,” Computer Vision and Image Understanding, vol. 104, pp. 249–257, Nov. 2006.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduction of Privacy-Preserving Action Recognition via Motion Difference Quantization

Table of Contents

Project Overview

Directory Structure

Core Components

Datasets & Metadata

Notebooks

Preprocessing Scripts

Models & Training

Visualization & Evaluation

Run the Project

Run On Kaggle

Run Locally

Reproduce Figure 3

BDQ Encoder Samples

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
bdq_encoder		bdq_encoder
checkpoints		checkpoints
datasets		datasets
notebooks		notebooks
raw_dataset_preprocess		raw_dataset_preprocess
samples		samples
visualization		visualization
.gitignore		.gitignore
README.md		README.md
action_recognition_model.py		action_recognition_model.py
loss.py		loss.py
preprocess.py		preprocess.py
privacy_attribute_prediction_model.py		privacy_attribute_prediction_model.py
requirements.txt		requirements.txt
training.py		training.py
training_no_encoder.py		training_no_encoder.py

Folders and files

Latest commit

History

Repository files navigation

Reproduction of Privacy-Preserving Action Recognition via Motion Difference Quantization

Table of Contents

Project Overview

Directory Structure

Core Components

Datasets & Metadata

Notebooks

Preprocessing Scripts

Models & Training

Visualization & Evaluation

Run the Project

Run On Kaggle

Run Locally

Reproduce Figure 3

BDQ Encoder Samples

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages