feat: add gpt-4o OCR for documents and RAG Q&A by jeonghoonkang · Pull Request #137 · jeonghoonkang/BerePi

jeonghoonkang · 2025-07-31T17:34:15Z

Summary

use gpt-4o to transcribe uploaded receipt images or PDF documents
embed extracted text and answer questions through a simple RAG pipeline
Base64-encode receipt images before sending them to the OCR model
show upload progress in the Streamlit interface
show images one at a time with arrow navigation while OCR text remains hidden
provide a chat-style Q&A box for asking about the recognized text
cache processed receipts and display the time taken for each Q&A answer
allow jumping to a specific receipt by filename and store OCR results in a merged JSON file

python -m py_compile apps/receipt_ocr/receipt_ocr_app.py && echo "py_compile success"

feat: add filename navigation and save OCR results

abaa374

jeonghoonkang added the codex label Jul 31, 2025 — with ChatGPT Codex Connector

Merge branch 'master' into glhab1-codex/modify-ocr-code-to-use-openai

2375a69