Skip to content

jacksonmatheson/python-japanese-ocr

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Japanese OCR in Python

Dependencies

  • Python 3
  • OpenCV >= 4
  • Tesseract (see below)

Tesseract

  • Install tesseract 4.0
  • Download jpn_vert.traineddata here
  • Copy jpn_vert.traineddata in /usr/share/tessdata
  • Check with tesseract --list-langs that jpn_vert correctly appears

Archlinux

pacman -S opencv tesseract-ocr-git tesseract-data-jpn

Ubuntu / Mint

sudo apt-get install -y tesseract-ocr tesseract-ocr-jpn-vert
sudo apt-get install -y python3-opencv

How to use

./main.py examples/sample_page.jpg

Followed by:

./main.py examples/sample_page.jpg --ocr

About

Japanese OCR in Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%