C++ OCR Pipeline utilizing ONNX

C++ OCR pipeline utilizing ONNX Runtime for optimized inference. Designed to replace the official high-level language implementations with improved textbox merging.

det+cls+rec.mp4

TODO

Character Detection
Directional Classifier
Character Recognition
Dockerize

Setup

Pull PaddlePaddle docker image

Check out the paddlepaddle website to get the current version. At the time of writing this it was the paddlepaddle/paddle:3.2.0.

If outside of china do not use the first recommended images containing cnc.bj.baidubce.com, as they will download slower than the official Docker Hub version.

docker pull paddlepaddle/paddle:3.2.0

Build and enter Docker container

As we won't be needing the container after exporting the ONNX models we set the --rm flag to automatically remove the container on exit.

If you plan on using the container for other tasks you can replace the --rm with --name YOUR_CONTAINER_NAME.

-v $(pwd)/onnx_models:/onnx_models just makes sure that the models we convert to ONNX will be persisted in the current_path/onnx_models on your machine.

If you decide to pull a different image, not the paddlepaddle/paddle:3.2.0, replace it in this command too.

docker run --rm -it \
  -v "$PWD"/onnx_models:/workspace/onnx_models \
  paddlepaddle/paddle:3.2.0 \
  /bin/bash

Obtain the ONNX models

Search through the model list sections for relevant model download links.

I will be using the mobile inference models as the score to file size and execution time ratio is far better than the regular models.

Download the PaddleOCR models

mkdir -p /workspace/onnx_models && cd /workspace/onnx_models

pip install paddleocr onnxruntime
paddlex --install paddle2onnx

alias wxt='f(){
    mkdir -p "$1"
    wget "$2" -O "$1".tar
    mapfile -t top <<<"$(tar tf "$1".tar | cut -d/ -f1 | uniq)"

    if [[ ${#top[@]} -eq 1 ]]; then
        if tar tvf "$1".tar | grep "^d" | grep -m1 "${top[0]}"; then
            tar xf "$1".tar --strip-components=1 -C "$1"
        else
            tar xf "$1".tar -C "$1"
        fi
    else
        tar xf "$1".tar -C "$1"
    fi

    paddlex --paddle2onnx --paddle_model_dir "$1" --onnx_model_dir .
    mv inference.onnx "$1".onnx

    rm -rf "$1" "$1".tar *.yml
}; f'

# these links can be found on the https://www.paddleocr.ai/main/en/version3.x/module_usage/module_overview.html
wxt det https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-OCRv5_mobile_det_infer.tar
#wxt rec https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0//PP-OCRv5_mobile_rec_infer.tar
wxt rec https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/latin_PP-OCRv5_mobile_rec_infer.tar
wxt cls https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x1_0_textline_ori_infer.tar

Convert the saved models to ONNX

pip install paddleocr
paddlex --install paddle2onnx
cd onnx_models

paddlex --paddle2onnx --paddle_model_dir cls --onnx_model_dir .
mv inference.onnx det.onnx

paddlex \
     --paddle2onnx \
     --paddle_model_dir PP-OCRv5_mobile_rec_infer \
     --onnx_model_dir .
mv inference.onnx rec.onnx

paddlex \
     --paddle2onnx \
     --paddle_model_dir PP-LCNet_x1_0_textline_ori_infer \
     --onnx_model_dir .
mv inference.onnx cls.onnx

exit
ls onnx_models

should return

det.onnx  inference.yml  rec.onnx

Download the dictionary for Character Recognition

wget -O dict.txt https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/refs/heads/main/ppocr/utils/dict/ppocrv5_en_dict.txt

Usage

Update the config.yaml to match your settings and filepaths

$ make

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
data		data
include		include
onnx_models		onnx_models
src		src
.clang-format		.clang-format
.clangd		.clangd
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
config.yaml		config.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

C++ OCR Pipeline utilizing ONNX

TODO

Setup

Pull PaddlePaddle docker image

Build and enter Docker container

Obtain the ONNX models

Download the PaddleOCR models

Convert the saved models to ONNX

Download the dictionary for Character Recognition

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

C++ OCR Pipeline utilizing ONNX

TODO

Setup

Pull PaddlePaddle docker image

Build and enter Docker container

Obtain the ONNX models

Download the PaddleOCR models

Convert the saved models to ONNX

Download the dictionary for Character Recognition

Usage

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages