HuaweiOCR

One-line summary

An automated device-label recognition tool: crop fields -> barcode decode -> OCR -> structured export (JSONL), with Windows one-click run.

Overview

This project is a local batch-processing tool for device label / barcode images. It combines object detection (Roboflow inference), barcode decoding, and OCR to extract Model and SN. The core idea is "locate first, then recognize":

Detect label areas in the full image using a Roboflow model.
Crop label areas and then crop model/sn sub-fields.
Run barcode decoding and OCR on the cropped fields.
Output structured results (JSONL) and debug logs for tracing and tuning.

Pipeline

Input -> crop -> barcode -> ocr -> postprocess -> output

Highlights

End-to-end batch processing: from raw images to results in one run.
Multi-stage cropping and filtering to improve accuracy.
Dual-channel decoding: barcode first, OCR as fallback.
Debug artifacts saved for easier troubleshooting.
Local/offline dependencies bundled to reduce external setup.

Core structure

crop.py: stage 1/2 cropping (Roboflow detect + crop)
scan2.py: barcode/OCR + structured output
barcode.py: barcode enhancement pipeline
run_all.py: one-click pipeline entry
start.bat: Windows one-click script

Environment

Windows
Python 3.10+ (recommended)
Valid Roboflow API key

Quick start (Windows)

Create .env in the project root (do not commit):

API_KEY=your_api_key_here

Double-click start.bat.

CLI

python run_all.py --input ./images --out ./out --format jsonl --log-level info --device cpu

Full options:

python run_all.py --help

Pipeline overview

Read images from new_images/.
Stage 1: detect and crop label areas -> stage1_labels/.
Stage 2: crop model/sn fields -> stage2_fields/.
scan2.py runs barcode + OCR.
Outputs model_sn_ocr.jsonl and debug_ocr_barcode.log.

Outputs

stage1_labels/: label crops
stage2_fields/model/: model crops
stage2_fields/sn/: SN crops
model_sn_ocr.jsonl: final results (one JSON per line)
debug_ocr_barcode.log: debug log Default output is repo root; use --out to change it.

Quantitative metrics

CLI prints:

total images, total time, average time per image
SN extraction success rate
regex pass rate
error distribution (barcode_fail / ocr_fail / regex_fail)

Output format example

Single JSONL line:

{"label_id":"img_001__label_1","model":"S380-S8P2T","sn":"4E25XXXXXXXX","model_src":"barcode","sn_src":"ocr","model_raw":"...","sn_raw":"..."}

Robustness strategies

Multi-scale upscaling for small barcodes
ROI cropping to reduce noise
Rotation attempts (0/90/180/270)
Regex validation for SN/Model
Failure samples saved for review

Roadmap

CSV export and configurable field mapping
Finer error taxonomy and visual reports
Incremental CLI processing with resume support
Lighter releases (optional LFS or model download scripts)

FAQ

No results: check image clarity, angle, lighting; or tune crop/thresholds.
API_KEY missing: ensure .env exists and is correct.
Unstable results: tune parameters in crop.py / scan2.py.

Security

.env is excluded from the repo, so API keys are not exposed.
Share .env privately when needed.
Do not hard-code API keys.
Key rotation: update API_KEY in .env without code changes.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
1		1
bundle		bundle
clearimage_cli/ClearImageCli		clearimage_cli/ClearImageCli
.env.example		.env.example
.gitignore		.gitignore
1.py		1.py
HuaweiOCR.spec		HuaweiOCR.spec
LICENSE		LICENSE
README.md		README.md
README_EN.md		README_EN.md
README_ZH.md		README_ZH.md
app_paths.py		app_paths.py
barcode.py		barcode.py
crop.py		crop.py
debug.py		debug.py
gui_app.py		gui_app.py
gui_app_en.py		gui_app_en.py
ocr.py		ocr.py
run_all.py		run_all.py
scan.py		scan.py
scan2.py		scan2.py
start.bat		start.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HuaweiOCR

One-line summary

Overview

Pipeline

Highlights

Core structure

Environment

Quick start (Windows)

CLI

Pipeline overview

Outputs

Quantitative metrics

Output format example

Robustness strategies

Roadmap

FAQ

Security

About

Uh oh!

Releases 1

Packages

Languages

License

xyjk0511/huaweiocr

Folders and files

Latest commit

History

Repository files navigation

HuaweiOCR

One-line summary

Overview

Pipeline

Highlights

Core structure

Environment

Quick start (Windows)

CLI

Pipeline overview

Outputs

Quantitative metrics

Output format example

Robustness strategies

Roadmap

FAQ

Security

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages