This project applies and compares several optimizations that improve the inference performance of machine learning models on CPUs.
The results of the experiments can be found in the results folder, make sure you create it before running the experiments.
The terraform code to create the infrastructure to reproduce the experiments can be found in the infrastructure folder.
Finally, run the experiments with:
python cpu_inference_workspace/bert_experiments.py --ds-size 1000
and
python cpu_inference_workspace/resnet_experiments.py --ds-size 1000