BYU ML Lab Deep Integration of LM into HWR

Aanaconda

After logging in, install Anaconda 3:

cd /tmp
curl -O https://repo.anaconda.com/archive/Anaconda3-2019.03-Linux-x86_64.sh
bash Anaconda3-5.2.0-Linux-x86_64.sh

Environment

Create the environment defined in environment.yaml.

conda env create -f environment.yaml --name hwr
conda activate hwr

Configuration

All configurations are stored in the config folder as .yaml files.

Execution

Downloading/Preparing Datasets

Ensure that you have an IAM Handwriting Database access account (register), and IAM On-Line Handwriting Database access account (register), then:

cd data
./generate-all-datasets.sh

For the first IAM prompt, use your username and password for IAM Handwriting DB, then for the second IAM prompt, use your username and password for IAM On-Line Handwriting DB. This script should download/extract/setup the IAM data.

Prepping / Resampling Data

Run cd ./data_processing/online_coordinate_data && python create_dataset.py to re-format data. The training scripts expect data to be in the format that is output by this script. ** TO DO: Steamline / simplify this step, or have it done by the dataloader.

Trajectory Recovery

Modifying/updating the config files

To use existing config options, just modify the config file directly. See config/DEBUG.yaml for an example configuaration with some descriptions (though it's not guaranteed to work). The example_weights/example.conf is working with the model weights in the example_weights folder. To add new options:

Add option to a config file
Modify ./hwr_utils/stroke_dataset.py.py class to accept new option
Modify train_stroke_recovery.py to read the option from the config file and pass to StrokeRecoveryDataset class
Modify hwr_utils.py at defaults to include a default parameter in case a config file does not specify your new option.

Training

Once the data is downloaded and the environment setup, setup a config file. You should then be able to train the model:

python train_stroke_recovery.py --config PATH_TO_CONFIG

Evaluation

An example config with a model and weights can be run for offline data (though you may need to configure where your offline data is within the script).

python stroke_recovery_offline.py

Also see python stroke_recovery_online.py, which is similar but for online data.

Handwriting Recognition

Modifying/updating the config files

To use existing config options, just modify the config file directly. To add new options:

Add option to a config file
Modify hw_dataset.py class to accept new option
Modify train.py to read the option from the config file and pass to HwDataset class
Modify hwr_utils.py at defaults to include a default parameter in case a config file does not specify your new option.

Train

To train, run train.py with one of the configurations found in the configs folder. For example:

python train.py --config ./configs/baseline.yaml

Recognize

python recognize.py sample_config.json prepare_font_data/output/0.png

or

python recognize.py sample_config_iam.json prepare_IAM_Lines/lines/r06/r06-000/r06-000-00.png

Fulton Super Computer Prerequisites

If you are a BYU student, consider requesting access to the supercomputer. Sign up here.

Next, request group access from Taylor Archibald.

Name		Name	Last commit message	Last commit date
Latest commit History 326 Commits
autoencoder		autoencoder
configs		configs
data_processing		data_processing
dependencies		dependencies
example_weights		example_weights
hwr_utils		hwr_utils
image_transforms		image_transforms
ipynb		ipynb
lm		lm
loss_module		loss_module
models		models
notes		notes
online_coordinate_data/8_stroke_vSmall_16/stroke_cached		online_coordinate_data/8_stroke_vSmall_16/stroke_cached
recognition		recognition
recognition_online		recognition_online
reference		reference
renderer		renderer
renderer2		renderer2
scripts		scripts
server		server
slurm_scripts		slurm_scripts
soft-dtw		soft-dtw
synthesis		synthesis
test		test
transformer		transformer
.gitignore		.gitignore
COMMANDS.sh		COMMANDS.sh
README.md		README.md
RESULTS		RESULTS
TheIAM-database.pdf		TheIAM-database.pdf
environment.yaml		environment.yaml
gen_online_test.py		gen_online_test.py
recognize.py		recognize.py
stroke_recovery_offline.py		stroke_recovery_offline.py
stroke_recovery_online.py		stroke_recovery_online.py
train_stroke_recovery.py		train_stroke_recovery.py
trainers.py		trainers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BYU ML Lab Deep Integration of LM into HWR

Aanaconda

Environment

Configuration

Execution

Downloading/Preparing Datasets

Prepping / Resampling Data

Trajectory Recovery

Modifying/updating the config files

Training

Evaluation

Handwriting Recognition

Modifying/updating the config files

Train

Recognize

Fulton Super Computer Prerequisites

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Tahlor/simple_hwr

Folders and files

Latest commit

History

Repository files navigation

BYU ML Lab Deep Integration of LM into HWR

Aanaconda

Environment

Configuration

Execution

Downloading/Preparing Datasets

Prepping / Resampling Data

Trajectory Recovery

Modifying/updating the config files

Training

Evaluation

Handwriting Recognition

Modifying/updating the config files

Train

Recognize

Fulton Super Computer Prerequisites

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages