FinLLM is an open-source assistant for financial document understanding and compliance automation, powered by OCR + LLMs + Vector Search (RAG). It enables users to extract content from scanned or digital documents and chat with them contextually.
- β Upload scanned PDFs, DOCX, or image files
- β Perform OCR using Tesseract
- β Chunk and embed content using sentence-transformers or OpenAI
- β Store in FAISS vector database
- β Run RAG-style querying using OpenAI or DeepSeek
- β CLI and Streamlit-based chat interface
- β Easily extensible for other domains (legal, healthcare, HR, etc.)
pip install git+https://github.com/akash049/fin-llm-assistant.git