PayGo - AI-Powered Invoice Processing System

An intelligent invoice processing system that automates data extraction, validation, and standardization for Accounts Payable teams.

Team Meta Cognition | National Institute of Technology, Rourkela

📋 Table of Contents

Overview
Problem Statement
Solution
Features
Architecture
Technology Stack
Installation
Usage
API Documentation
Performance
Roadmap
Contributing
Team
License

🎯 Overview

PayGo is an AI-powered invoice processing system designed to solve the challenges faced by Accounts Payable (AP) teams when dealing with diverse invoice formats. By leveraging OCR and Large Language Models, PayGo automatically extracts, validates, and standardizes invoice data, reducing manual workload and eliminating processing errors.

Key Highlights

95%+ Accuracy in data extraction across diverse invoice formats
10x Faster processing compared to manual data entry
Multi-format Export (JSON, CSV, TXT, XLSX)
Human-in-the-Loop validation for quality assurance
Cloud-Ready containerized deployment
Audit Trail for compliance and tracking

🚨 Problem Statement

The Challenges

Invoice Diversity
- Varied layouts make fixed rules ineffective
- AP teams struggle to maintain speed and accuracy
- Scalability issues with increasing invoice volumes
Manual Processing
- Slow and error-prone human-driven workflows
- Incorrect payments and lost discounts
- Rising operational expenses
Lack of Standardization
- Inconsistent date formats, vendor names, and currencies
- Limited audit trails create compliance risks
- Difficult system integration

✨ Solution

PayGo addresses these challenges through three core components:

1. AI-Powered Invoice Understanding

OCR Technology: Azure OCR Intelligence for text extraction
NLP/LLM Processing: OpenAI GPT-4o-mini for intelligent parsing
Format Agnostic: Handles any invoice layout or language
Field Extraction: Automatically identifies Invoice ID, Date, Vendor, Amount, Currency, and more

2. Human-in-the-Loop Validation

Auto-validation: High-confidence fields processed automatically
Smart Flagging: Low-confidence extractions flagged for review
Quality Assurance: Guarantees accuracy while reducing manual workload
Confidence Scoring: Transparency in extraction reliability

3. Automated Data Standardization

Format Canonicalization: Standardizes dates, vendor names, and currencies
Structured Output: Converts data to JSON, CSV, TXT, or XLSX
ERP Integration: Seamless data push to accounting systems
Audit Trail: Complete processing history for compliance

🚀 Features

Core Functionality

✅ Multi-format Invoice Support - PDF, JPG, PNG, and more
✅ Intelligent Field Extraction - Invoice ID, Date, Vendor, Amount, Tax, Currency
✅ Confidence Scoring - AI-powered reliability indicators
✅ Validation Workflow - Human review for uncertain extractions
✅ Data Standardization - Consistent format across all outputs
✅ Multiple Export Formats - JSON, CSV, TXT, XLSX
✅ Batch Processing - Handle multiple invoices simultaneously
✅ Audit Logging - Complete processing history

Technical Features

🔧 Parallelized Processing - Multi-threaded invoice handling
🐳 Docker Support - Containerized for easy deployment
☁️ Cloud-Ready - Scalable architecture
🔒 Secure - Data encryption and secure storage
📊 Database Integration - Persistent storage of processed data
🔌 API-First Design - RESTful API for integration

🏗️ Architecture

Processing Workflow

┌─────────────────┐
│  User Uploads   │
│  Invoice (PDF/  │
│  JPG/PNG)       │
└────────┬────────┘
         │
         ▼
┌─────────────────┐      ┌──────────────┐      ┌─────────────┐
│  Image          │─────▶│  OCR Model   │─────▶│  Parser     │
│  Processing     │      │  (Azure)     │      │  Model      │
└─────────────────┘      └──────────────┘      │  (GPT-4o    │
                                                │  mini)      │
                                                └──────┬──────┘
                                                       │
         ┌─────────────────────────────────────────────┘
         │
         ▼
┌─────────────────┐      ┌──────────────┐      ┌─────────────┐
│  Flagging       │◀─────│  Format      │◀─────│  Database   │
│  (Low           │      │  Conversion  │      │  Storage    │
│  Confidence)    │      └──────────────┘      └─────────────┘
└────────┬────────┘              │
         │                       │
         ▼                       ▼
┌─────────────────┐      ┌──────────────────────┐
│  Human Review   │      │  Available to        │
│  & Validation   │      │  Download (JSON/CSV/ │
└─────────────────┘      │  XLSX/TXT)          │
                         └──────────────────────┘

Parallelization Architecture

Invoice Capture → Image Processing → Multithreading Manager
                                              │
                        ┌─────────────────────┼─────────────────────┐
                        ▼                     ▼                     ▼
                   [Thread 1]            [Thread 2]            [Thread 3]
                        │                     │                     │
                        └─────────────────────┼─────────────────────┘
                                              ▼
                                    Aggregation & Structured
                                    Output Generation
                                              │
                                              ▼
                                    Validation and Review
                                              │
                                              ▼
                                    Integration with ERP

🛠️ Technology Stack

OCR Models Evaluated

OCR Engine	Parsing Model	Performance	Status
Tesseract	Meta Llama 3	Low accuracy, free	❌ Not Used
PaddleOCR	OpenAI GPT-4o	Fast but unreliable on complex layouts	❌ Not Used
Google Document AI	Google Gemini 1.5	Good integration, high cost	❌ Not Used
Azure OCR Intelligence	OpenAI GPT-4o-mini	High accuracy, low cost, fast	✅ Selected

Core Technologies

OCR: Azure OCR Intelligence
AI/NLP: OpenAI GPT-4o-mini
Backend: Python 3.8+
Database: PostgreSQL/SQLite
Containerization: Docker
Cloud Platform: AWS/Azure/GCP (configurable)

Dependencies

- azure-ai-formrecognizer
- openai
- pandas
- pillow
- opencv-python
- openpyxl
- sqlalchemy
- fastapi
- uvicorn

📦 Installation

Prerequisites

Python 3.8 or higher
Docker (optional, for containerized deployment)
Azure Account with OCR Intelligence enabled
OpenAI API Key

Local Setup

Clone the repository

git clone https://github.com/HrushikeshAnandSarangi/paygo.git
cd paygo

Create a virtual environment

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```

Configure environment variables

cp .env.example .env
# Edit .env with your API keys

Required environment variables:

AZURE_OCR_KEY=your_azure_ocr_key
AZURE_OCR_ENDPOINT=your_azure_endpoint
OPENAI_API_KEY=your_openai_api_key
DATABASE_URL=your_database_url

Run database migrations
```
python manage.py migrate
```

Start the application

python app.py
# Or using uvicorn for FastAPI
uvicorn main:app --reload

Docker Deployment

Build the Docker image
```
docker build -t paygo:latest .
```

Run the container

docker run -d \
  -p 8000:8000 \
  -e AZURE_OCR_KEY=your_key \
  -e OPENAI_API_KEY=your_key \
  --name paygo-app \
  paygo:latest

Using Docker Compose
```
docker-compose up -d
```

💻 Usage

Web Interface

Access the application at http://localhost:8000
Upload invoice files (PDF, JPG, PNG)
Wait for AI processing
Review flagged fields if any
Download processed data in your preferred format

API Usage

Upload Invoice

curl -X POST http://localhost:8000/api/upload \
  -F "file=@invoice.pdf"

Get Processing Status

curl http://localhost:8000/api/status/{invoice_id}

Download Processed Data

# JSON format
curl http://localhost:8000/api/download/{invoice_id}?format=json

# CSV format
curl http://localhost:8000/api/download/{invoice_id}?format=csv

# Excel format
curl http://localhost:8000/api/download/{invoice_id}?format=xlsx

# Text format
curl http://localhost:8000/api/download/{invoice_id}?format=txt

Python SDK

from paygo import InvoiceProcessor

# Initialize processor
processor = InvoiceProcessor(
    azure_key="your_azure_key",
    openai_key="your_openai_key"
)

# Process invoice
result = processor.process_invoice("path/to/invoice.pdf")

# Access extracted data
print(result.invoice_id)
print(result.vendor_name)
print(result.total_amount)

# Export to different formats
result.to_json("output.json")
result.to_csv("output.csv")
result.to_excel("output.xlsx")

📚 API Documentation

Endpoints

`POST /api/upload`

Upload and process an invoice.

Request:

Method: POST
Content-Type: multipart/form-data
Body: file (PDF, JPG, PNG)

Response:

{
  "invoice_id": "uuid",
  "status": "processing",
  "message": "Invoice uploaded successfully"
}

`GET /api/status/{invoice_id}`

Get processing status of an invoice.

Response:

{
  "invoice_id": "uuid",
  "status": "completed",
  "confidence": 0.95,
  "requires_review": false
}

`GET /api/invoice/{invoice_id}`

Retrieve extracted invoice data.

Response:

{
  "invoice_id": "INV-2024-001",
  "vendor_name": "Acme Corp",
  "invoice_date": "2024-01-15",
  "due_date": "2024-02-15",
  "total_amount": 1250.00,
  "currency": "USD",
  "tax_amount": 125.00,
  "line_items": [
    {
      "description": "Product A",
      "quantity": 10,
      "unit_price": 100.00,
      "total": 1000.00
    }
  ],
  "confidence_scores": {
    "invoice_id": 0.98,
    "total_amount": 0.97,
    "vendor_name": 0.95
  }
}

`GET /api/download/{invoice_id}`

Download processed invoice data.

Query Parameters:

format: json, csv, xlsx, txt

📊 Performance

Benchmarks

Processing Time: 2-5 seconds per invoice (average)
Accuracy: 95%+ on diverse invoice formats
Throughput: 100+ invoices per minute (with parallelization)
Confidence Threshold: 85% for auto-validation

Improvements from Feedback

Export Formats
- ❌ Before: JSON and CSV only
- ✅ After: JSON, CSV, TXT, and XLSX
Deployment
- ❌ Before: Not deployment-ready, scaling issues
- ✅ After: Dockerized and cloud-deployed
Feature Scope
- ❌ Before: Off-topic features (due date notifications, balance sheets)
- ✅ After: Focused core functionality

🗺️ Roadmap

Phase 1 (Completed) ✅

Phase 2 (In Progress) 🚧

Phase 3 (Planned) 📋

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Development Guidelines

Follow PEP 8 style guide for Python code
Write unit tests for new features
Update documentation as needed
Ensure all tests pass before submitting PR

👥 Team

Team Meta Cognition
National Institute of Technology, Rourkela

Sujal Kumar Agarwal - Team Leader
Kunal Kushwaha - Member
Istaprasad Patra - Member
Hrushikesh Anand Sarangi - Member

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Azure OCR Intelligence for powerful text extraction
OpenAI for advanced language understanding
National Institute of Technology, Rourkela
All contributors and testers

📞 Support

For questions, issues, or feature requests, please:

Check the Issues page
Create a new issue if your question isn't already addressed
Contact the team at [email]

🔗 Links

#PowerdByMeta_Cognition

Made with ❤️ by Team Meta Cognition

⬆ back to top

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
backend		backend
paygo		paygo
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
azure-pipelines.yml		azure-pipelines.yml
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

PayGo - AI-Powered Invoice Processing System

📋 Table of Contents

🎯 Overview

Key Highlights

🚨 Problem Statement

The Challenges

✨ Solution

1. AI-Powered Invoice Understanding

2. Human-in-the-Loop Validation

3. Automated Data Standardization

🚀 Features

Core Functionality

Technical Features

🏗️ Architecture

Processing Workflow

Parallelization Architecture

🛠️ Technology Stack

OCR Models Evaluated

Core Technologies

Dependencies

📦 Installation

Prerequisites

Local Setup

Docker Deployment

💻 Usage

Web Interface

API Usage

Upload Invoice

Get Processing Status

Download Processed Data

Python SDK

📚 API Documentation

Endpoints

POST /api/upload

GET /api/status/{invoice_id}

GET /api/invoice/{invoice_id}

GET /api/download/{invoice_id}

📊 Performance

Benchmarks

Improvements from Feedback

🗺️ Roadmap

Phase 1 (Completed) ✅

Phase 2 (In Progress) 🚧

Phase 3 (Planned) 📋

🤝 Contributing

Development Guidelines

👥 Team

📄 License

🙏 Acknowledgments

📞 Support

🔗 Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /api/upload`

`GET /api/status/{invoice_id}`

`GET /api/invoice/{invoice_id}`

`GET /api/download/{invoice_id}`

Packages