🚗 RC Text Extraction Tool

A powerful TypeScript React application that extracts vehicle Registration Certificate (RC) details from front and back images using OCR and AI technologies.

✨ Features

Dual Image Upload: Upload both front and back RC images with validation
Independent OCR Processing: Extract raw text using Tesseract.js with confidence scoring
Independent AI-Powered Extraction: Use Google Gemini to parse structured data from images
Parallel Processing: OCR and LLM work independently for flexible data extraction
Comprehensive Metrics: Track accuracy, processing time, and success rates for each method
Responsive Design: Clean, minimal interface that works on all devices
Real-time Progress: Visual progress indicators for processing status

🛠 Technologies Used

Frontend: TypeScript + React 19 + Vite
OCR: Tesseract.js
AI: Google Generative AI (Gemini)
Styling: Vanilla CSS with CSS Grid & Flexbox
Build Tool: Vite
Type Safety: TypeScript with strict configuration

📋 RC Data Fields Extracted

Vehicle Registration Number
Owner Name
Vehicle Class
Fuel Type
Engine Number
Chassis Number
Registration Date
Validity Date

🚀 Getting Started

Prerequisites

Node.js 18+
npm or yarn
Google AI API key

Installation

Clone the repository

git clone <repository-url>
cd rc-text-extract

Install dependencies
```
npm install
```
Configure environment variables
```
cp .env.example .env
```
Edit .env and add your Google AI API key:
```
VITE_GOOGLE_API_KEY=your_actual_api_key_here
```
Start development server
```
npm run dev
```
Open in browser Navigate to http://localhost:5173

Getting Google AI API Key

Visit Google AI Studio
Sign in with your Google account
Create a new API key
Copy the key and add it to your .env file

📁 Project Structure

src/
├── components/
│   ├── ImageUpload.tsx       # Image upload with validation
│   ├── ImageUpload.css
│   ├── OCRSection.tsx        # Tesseract OCR processing
│   ├── OCRSection.css
│   ├── LLMSection.tsx        # Google Gemini integration
│   ├── LLMSection.css
│   ├── MetricsPanel.tsx      # Performance metrics
│   └── MetricsPanel.css
├── types/
│   └── index.ts              # TypeScript interfaces
├── App.tsx                   # Main application component
├── App.css                   # Main layout styles
├── index.css                 # Global styles
└── main.tsx                  # Application entry point

🎯 Usage Guide

Step 1: Upload Images

Click to upload RC front image
Click to upload RC back image
Supported formats: JPG, PNG (max 10MB each)
Both images required for processing

Step 2: Processing Options

Choose your preferred extraction method(s):

Option A: OCR Extraction

Click "Extract Text with OCR"
Wait for Tesseract to process both images
View extracted raw text and confidence score
Get parsed structured data from OCR

Option B: LLM Extraction

Click "Extract Data with LLM"
Google Gemini processes the uploaded images directly
Get structured data extraction with confidence scores
Independent of OCR processing

Option C: Both Methods

Run both OCR and LLM independently
Compare results and accuracy between methods
Choose the best extraction for your needs

Step 3: Analyze Results

Review processing metrics for each method
Compare OCR vs LLM performance (if both used)
View extracted data fields and confidence scores
Check processing times and success rates

🔧 Configuration

Environment Variables

VITE_GOOGLE_API_KEY=your_google_api_key

Build Configuration

TypeScript: Strict mode enabled
Vite: Modern build tool with HMR
ESLint: Code quality and consistency
CSS: Modern CSS features with vendor prefixes

📱 Responsive Design

Desktop: Side-by-side processing sections
Tablet: Stacked layout with full-width components
Mobile: Single-column layout with optimized spacing

🔒 Security & Privacy

All processing happens client-side initially
Google AI API calls use HTTPS encryption
No image data stored permanently
API keys secured via environment variables

🚦 Error Handling

Image format validation
File size limits
OCR processing failures
API connectivity issues
Invalid API key detection
JSON parsing errors

📊 Performance Metrics

The application tracks metrics for each processing method independently:

OCR Metrics

OCR confidence percentage
OCR processing time
Text extraction accuracy
Parsed field completion rate

LLM Metrics

LLM extraction confidence
LLM processing time
Direct extraction accuracy
Structured data completeness

Comparison Features

Side-by-side performance comparison
Method-specific success indicators
Individual processing status tracking
Field extraction completion rates

🛠 Development Scripts

# Development server
npm run dev

# Build for production
npm run build

# Preview production build
npm run preview

# Lint code
npm run lint

# Type checking
npx tsc --noEmit

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes with proper TypeScript types
Add CSS for any new components
Test thoroughly
Submit a pull request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Tesseract.js for OCR capabilities
Google AI for Generative AI
Vite for build tooling
React for the UI framework

📞 Support

For issues and questions:

Check the existing issues
Create a new issue with detailed description
Include screenshots for UI problems
Provide console logs for technical issues

Built with ❤️ using TypeScript, React, and modern web technologies.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚗 RC Text Extraction Tool

✨ Features

🛠 Technologies Used

📋 RC Data Fields Extracted

🚀 Getting Started

Prerequisites

Installation

Getting Google AI API Key

📁 Project Structure

🎯 Usage Guide

Step 1: Upload Images

Step 2: Processing Options

Option A: OCR Extraction

Option B: LLM Extraction

Option C: Both Methods

Step 3: Analyze Results

🔧 Configuration

Environment Variables

Build Configuration

📱 Responsive Design

🔒 Security & Privacy

🚦 Error Handling

📊 Performance Metrics

OCR Metrics

LLM Metrics

Comparison Features

🛠 Development Scripts

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Languages

rreddyja/rc-text-extract

Folders and files

Latest commit

History

Repository files navigation

🚗 RC Text Extraction Tool

✨ Features

🛠 Technologies Used

📋 RC Data Fields Extracted

🚀 Getting Started

Prerequisites

Installation

Getting Google AI API Key

📁 Project Structure

🎯 Usage Guide

Step 1: Upload Images

Step 2: Processing Options

Option A: OCR Extraction

Option B: LLM Extraction

Option C: Both Methods

Step 3: Analyze Results

🔧 Configuration

Environment Variables

Build Configuration

📱 Responsive Design

🔒 Security & Privacy

🚦 Error Handling

📊 Performance Metrics

OCR Metrics

LLM Metrics

Comparison Features

🛠 Development Scripts

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages