This project presents an automatic business card scanning system that simplifies the process of adding new contacts from physical cards.
The system extracts and classifies text information from business card images using a hybrid OCR + NER pipeline.
-
Image Input
The user provides an image of a business card. -
Text Extraction (OCR Stage)
The image is passed to a pre-trained PaddleOCR model, which extracts all visible text from the card with high accuracy. -
Entity Recognition (NER Stage)
The extracted text is then processed by a Named Entity Recognition (NER) model to identify and classify key information such as:- π€ Name / Surname
- π’ Company / Job Title
- π§ Email
- π Phone Number
- π Address
-
Hybrid Classification Approach
The NER stage uses a hybrid method that combines:- Regular expressions (RegEx) for pattern-based entity detection (emails, phone numbers, etc.).
- Gliner model β a transformer-based NER architecture that allows custom label definition without retraining, providing flexibility and adaptability across languages and formats.
| Component | Technology |
|---|---|
| OCR Engine | PaddleOCR |
| NER Model | Gliner |
| Text Preprocessing | Python RegEx |
| Frameworks | Python, PyTorch |
| Output Format | JSON (structured contact information) |