img_2_pdf

This repository now contains a practical document-processing toolkit focused on one main goal:

Accept camera captures or imported files/folders.
Run document preprocessing and cleanup.
Export a clean merged PDF.
Add optional OCR later as a controlled extension.

Current Main App

Run:

python camscan_hybrid_tool.py

camscan_hybrid_tool.py is the current unified variant for your workflow.

New Unified App (In Progress)

A new package-based unified application is being built under src/uniscan.

Run (from repository root):

set PYTHONPATH=src
python -m uniscan.cli

One-script launcher (recommended on Windows):

.\run_uniscan.cmd

Or after installation:

uniscan

Quick workflow (Office Lens style):

Open tab 1. Import (main mode) and load files/folder, or use 2. Scan for camera capture.
Import and Scan are acquisition-only: they load/capture raw pages into session.
App switches to 3. Review: reorder, rotate, deskew, auto crop, manual corners, and side-by-side Before/After preview.
All processing controls are in Review: quick dropdowns (Lens, Post, Preset), Advanced... popup sliders, and an Apply all changes to all files scope checkbox.
Review uses lightweight previews by default (Full HD); uncheck it to work directly with full-resolution previews.
Auto Crop... opens a page browser with auto-detect and manual corner editing for one page or all pages.
Open 4. Export, choose OCR engine if needed, then save merged PDF or image files.

Current implemented modules in this new app:

Capture: live preview, single capture, burst capture, camera configuration
Import: folder/files (multi-select)/PDF import into one session
Pages: page list management (preview, reorder, select/delete)
Export: merged PDF and separate image export

Implementation notes:

Session pages are disk-backed (uniscan cache) with lazy reads to reduce RAM usage on large batches.
Pages review now shows Before/After preview for preprocessing visibility.
Capture/import keep originals first; processing is only applied from Review.
Export tab supports OCR engine selection with dependency status checks.
Import supports multi-file selection and background loading.
Import order is preserved end-to-end: folder order, document page order, and mixed import order are kept as selected.
Searchable PDF is currently wired for pytesseract, OCRmyPDF, and PyMuPDF OCR.
PaddleOCR, Surya, and MinerU are available as selectable OCR backends with readiness checks (searchable-PDF wiring pending).

What The App Does

camscan_hybrid_tool.py supports three source modes:

Import folder
Import files
Camera capture

Processing features:

Document detection and perspective extraction using third-party logic from camscan_suhren: camscan.scanner.main
Postprocessing effects from camscan_suhren: None, Sharpen, Grayscale, Black and White
Optional two-page split (left/right) for book-like captures.
Merged PDF export from all source modes.
Quality profiles (Fast, Balanced, Best quality) for practical output control.

Note:

OCR is intentionally left as the next stage and is not active in camscan_hybrid_tool.py yet.

Processing Pipeline

For folder/files mode:

Load input images (and PDF pages if PDF files are provided and pymupdf is installed).
Optionally detect and extract document contour.
Apply selected postprocessing function.
Optionally split each page into left/right halves.
Convert processed pages into one merged PDF.

For camera mode:

Capture N shots from selected camera index.
Wait configured delay between shots.
Apply the same processing pipeline as above.
Export merged PDF.

Setup

Recommended Python:

Python 3.11+

Install dependencies:

pip install opencv-python numpy pillow img2pdf pymupdf

Optional OCR dependencies in the new app:

pip install pytesseract pypdf ocrmypdf paddleocr pymupdf

Also install CLI/system tools where needed:

Tesseract OCR engine in PATH for pytesseract and PyMuPDF OCR mode.
ocrmypdf command in PATH for OCRmyPDF mode.

Experimental engine packages:

Surya (or marker package path that bundles Surya OCR).
MinerU (mineru or magic_pdf package).

If you plan to use legacy scripts with OCR, install additionally:

pip install ocrmypdf pypdf

External OCR tools for legacy OCR scripts:

Tesseract OCR
Ghostscript
qpdf
Poppler (pdftoppm, pdfunite) for only_tesseract.py in PDF mode

How To Run

Main app:

python camscan_hybrid_tool.py

Alternative app (images/file + optional OCR already integrated):

python unified_pdf_tool.py

Legacy apps (kept for reference/fallback):

python fast.py
python img_2_pdf.py
python only_tesseract.py
python "prepare pdf to tesseract.py"

Script Map

File	Role
`camscan_hybrid_tool.py`	Main hybrid app (`camera + files/folder`) using third-party processing logic from `camscan_suhren`
`unified_pdf_tool.py`	Unified app for folder/file workflows with optional OCR path
`fast.py`	OCR-focused GUI with batch PDF support
`img_2_pdf.py`	Photo-to-PDF app with OpenCV preprocessing and optional OCR
`only_tesseract.py`	OCR pipeline using direct `tesseract.exe` calls
`imgs_and_pdfs_ocr_fast_STABLE.py`	Stable previous OCR GUI version
`prepare pdf to tesseract.py`	PDF conditioning helper before OCR
`camscan_suhren/`	Third-party camera scanner project used as source of preprocessing logic

Known Limitations

OCR in camscan_hybrid_tool.py is not enabled yet (planned next).
Camera mode is shot-based capture (not a full continuous preview UI).
PDF import in hybrid mode requires pymupdf.

Troubleshooting

Error about missing camscan modules: Ensure folder camscan_suhren exists directly inside repo root.
Cannot open camera: Check camera index and close other apps using webcam.
PDF import error: Install pymupdf (pip install pymupdf).

Short Roadmap

Add optional OCR to camscan_hybrid_tool.py with toggle and language setting.
Add stronger camera UX (preview/retake/selection before export).
Add job queue for large folder batches.
Add tests for hybrid pipeline stages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

img_2_pdf

Current Main App

New Unified App (In Progress)

What The App Does

Processing Pipeline

Setup

How To Run

Script Map

Known Limitations

Troubleshooting

Short Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
docs		docs
src/uniscan		src/uniscan
tests		tests
.gitignore		.gitignore
README.md		README.md
camscan_hybrid_tool.py		camscan_hybrid_tool.py
fast.py		fast.py
img_2_pdf.py		img_2_pdf.py
imgs_and_pdfs_ocr_fast_STABLE.py		imgs_and_pdfs_ocr_fast_STABLE.py
naps2-7.5.3-win.exe		naps2-7.5.3-win.exe
only_tesseract.py		only_tesseract.py
prepare pdf to tesseract.py		prepare pdf to tesseract.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
run_uniscan.cmd		run_uniscan.cmd
unified_pdf_tool.py		unified_pdf_tool.py

Folders and files

Latest commit

History

Repository files navigation

img_2_pdf

Current Main App

New Unified App (In Progress)

What The App Does

Processing Pipeline

Setup

How To Run

Script Map

Known Limitations

Troubleshooting

Short Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages