Optical Character Recognition: Converting wine catalouge images to text

OCR using Pytesseract

Dependencies:

1.) Pytesseract
2.) Matplotlib
3.) Numpy
4.) openCV

Key features of the ocr-python.py script:

Executable Python script that is highly resuable - just add a for-loop to itterate the execution of tesseract on up to as many images as want.

Multi-layered filtering- just execute the ocr-python.py script to run multiple filters through images to imporve accuracy of tesseract OCR recognition

Basic Function Filter:

Images Adjusting Filter:

Uses matplotlib to allow you to visualize how your filtering functions are actually manipulating images- display is based on a timer that you can control

Display the hOCR data by uncommenting this command for each filter

The hOCR data output will look like this:

Next Steps:

Enhance image quality
Automate rotattion of image
Deskewing / Border Removal
Cancel noise
Logical operation to output the text with the highest confidence
Incorporate data training and testing

Next Step Flow Chart:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
.DS_Store		.DS_Store
README.md		README.md
UCD_Lehmann_0036.jpg		UCD_Lehmann_0036.jpg
UCD_Lehmann_0036.tiff		UCD_Lehmann_0036.tiff
ocr-basic-output.txt		ocr-basic-output.txt
ocr-filtered-output.txt		ocr-filtered-output.txt
ocr-python.py		ocr-python.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition: Converting wine catalouge images to text

OCR using Pytesseract

Dependencies:

Key features of the ocr-python.py script:

Next Steps:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition: Converting wine catalouge images to text

OCR using Pytesseract

Dependencies:

Key features of the ocr-python.py script:

Next Steps:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages