Skip to content

Optical Character Recognition for Sundanese Script.

Notifications You must be signed in to change notification settings

brokamal/SundaScriptOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sundanese Script Optical Character Recognition

Table of content

Introduction

Sundanese script OCR, made using YOLO for object detection, and LSTM for transliteration.

Feature

  • Individual character detection using YOLO.
  • Transliteration to Latin using Seq2Seq LSTM/BiLSTM, based on YOLO detection result. resultalldemo

Flowchart

diagram

  1. Image is uploaded into the main webapp for cropping
  2. Webapp send image to pre-trained YOLO model for detection.
  3. Webapp recieve respond from YOLO containing detection result.
  4. Webapp process the detection result before sending it to transliteration model for transliteration.
  5. Transliteration model repsonds with the transliteration result.

Installation-and-Setup

  1. Clone this repository
  2. Install the required dependencies:
pip install -r requirements.txt
  1. Train model using this Dataset: Sundanese Script Dataset

  2. Deploy your trained model on roboflow

  3. Insert your own Roboflow API key in the code On App.py:

roboflow_client = InferenceHTTPClient(
    api_url="https://detect.roboflow.com",
    api_key="API PLACEHOLDER" # add your own API key
)
  1. Run the application:
python app.py
  1. Open your browser and go to http://127.0.0.1:5000/
  2. On another terminal, go to Transliteration_model-bcnr directory
  3. Run the transliteration model:
python serve.py

About

Optical Character Recognition for Sundanese Script.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published