GitHub - LJungang/CMAT: [Applied Soft Computing, 2025] CMAT: A Cross-Model Adversarial Texture for Scanned Document Privacy Protection.

CMAT: Cross-Model Adversarial Texture for Scanned Document Privacy Protection

Xiaoyu Ye¹, Jingjing Yu^2*, Jungang Li¹, Yiwen Zhao¹

¹Peking University ²Capital University of Economics and Business

* Corresponding Author

📑 Paper (Applied Soft Computing 2025) | 📂 Dataset (AdvDocument) | 🏠 Project Page

This repository provides the official implementation of CMAT, a cross-model adversarial texture designed to protect scanned document privacy by degrading OCR text detection performance while maintaining human readability.

If you find this work useful, please consider giving us a ⭐ to support our research.

📌 Method Overview

Our Cross-Model Adversarial Texture (CMAT) generation process consists of three steps: (1) Preparation (dataset processing & attack initialization); (2) Cross-Model Attack (toroidal cropping + weighted loss fusion across detectors); (3) Protection (applying CMAT to documents so that detectors fail to locate text).

Installation

Step 1: Create Conda Environment (T-SEA)

conda create -n text-attack python=3.7
conda activate text-attack
pip install -r requirements.txt

Step 2: Install MMOCR from Source

pip install -U openmim
mim install mmcv-full
pip install mmdet

cd detlib/mmocr
pip install -r requirements.txt
pip install -v -e .

Then replace:

lib/python3.7/site-packages/mmdet/models/detectors/base.py

with:

./base.py (provided in the repository)

Step 3: Additional Python Packages

pip install Image
pip install jupyter

# If PIL errors occur:
pip uninstall pillow
pip install "pillow<7"

Attack Usage

Configure Detectors

Edit configs/parallel.yaml:

DETECTOR:
  NAME: ["PS_IC15"]  # ,"PS_CTW","PANET_IC15","PANET_CTW"]
  WEIGHT: [1.0, 1.0, 1.0]

Single / Multi-Model (Equal Weights)

CUDA_VISIBLE_DEVICES=0 nohup python train_optim_text.py \
  -cfg=parallel.yaml -s=./results/crop \
  -np > ./results/crop.log 2>&1 &

Outputs stored in:

results/crop.log
results/crop/ (perturbation)

Multi-Model (Weighted Loss Fusion)

CUDA_VISIBLE_DEVICES=0 nohup python train_parallel_text.py \
  -cfg=parallel.yaml -s=./results/parallel \
  -np > ./results/parallel.log 2>&1 &

Loss Weight Search (NNI)

nnictl create --config ./nni_config.yaml

NNI automatically explores detector loss weight configurations and reports optimal settings.

Evaluation

Modify:

detlib/mmocr/configs/_base_/det_datasets/icdar2015.py

Set dataset paths and update:

test = dict(
    type=dataset_type,
    ann_file=f'{data_root}/[json file name]',
    img_prefix=f'{data_root}/[Image dir name]',
    pipeline=None
)

Run evaluation:

CUDA_VISIBLE_DEVICES=0 python detlib/mmocr/tools/test_attack.py \
  detlib/mmocr/configs/textdet/psenet/psenet_r50_fpnf_600e_icdar2015_adv.py \
  https://download.openmmlab.com/mmocr/textdet/psenet/psenet_r50_fpnf_600e_icdar2015_pretrain-eefd8fe6.pth \
  --eval hmean-iou \
  --perturbation [perturbation file path] \
  --show --show-dir [output directory]

AdvDocument Dataset

We release the AdvDocument dataset used in our experiments (COCO format):

Download:
https://drive.google.com/file/d/1r4scB0HvLdz1NLinWI0KClqyyF8UrWnN/view?usp=drive_link

Example Display

Citation

If you use CMAT or AdvDocument in your research, please cite:

@article{YE2025114353,
  title   = {CMAT: A cross-model adversarial texture for scanned document privacy protection},
  journal = {Applied Soft Computing},
  pages   = {114353},
  year    = {2025},
  issn    = {1568-4946},
  doi     = {https://doi.org/10.1016/j.asoc.2025.114353},
  url     = {https://www.sciencedirect.com/science/article/pii/S1568494625016667},
  author  = {Xiaoyu Ye and Jingjing Yu and Jungang Li and Yiwen Zhao},
  keywords = {Document active privacy protection, Adversarial texture, Cross-model texture},
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
attack		attack
bak		bak
configs		configs
detlib		detlib
image/README		image/README
results		results
scripts		scripts
tools		tools
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
base.py		base.py
evaluate.py		evaluate.py
nni_parallel_text.py		nni_parallel_text.py
nni_search.yaml		nni_search.yaml
requirements.txt		requirements.txt
search_space.json		search_space.json
train_optim.py		train_optim.py
train_optim_text.py		train_optim_text.py
train_parallel_text.py		train_parallel_text.py
train_pgd.py		train_pgd.py
train_weighted_text.py		train_weighted_text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CMAT: Cross-Model Adversarial Texture for Scanned Document Privacy Protection

📌 Method Overview

📚 Contents

Installation

Step 1: Create Conda Environment (T-SEA)

Step 2: Install MMOCR from Source

Step 3: Additional Python Packages

Attack Usage

Configure Detectors

Single / Multi-Model (Equal Weights)

Multi-Model (Weighted Loss Fusion)

Loss Weight Search (NNI)

Evaluation

AdvDocument Dataset

Example Display

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

LJungang/CMAT

Folders and files

Latest commit

History

Repository files navigation

CMAT: Cross-Model Adversarial Texture for Scanned Document Privacy Protection

📌 Method Overview

📚 Contents

Installation

Step 1: Create Conda Environment (T-SEA)

Step 2: Install MMOCR from Source

Step 3: Additional Python Packages

Attack Usage

Configure Detectors

Single / Multi-Model (Equal Weights)

Multi-Model (Weighted Loss Fusion)

Loss Weight Search (NNI)

Evaluation

AdvDocument Dataset

Example Display

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages