AI Image Analysis Service

A production-ready asynchronous image analysis service built with FastAPI, Apache Kafka, Celery, and HuggingFace Transformers. This service provides scalable and resilient AI-driven image analysis capabilities including classification and object detection.

Features

Asynchronous Processing: Non-blocking API with distributed task queue
Event-Driven Architecture: Apache Kafka for reliable message distribution
Scalable Workers: Celery workers with Redis backend
AI-Powered Analysis: Pre-trained HuggingFace models for image classification and object detection
Robust Error Handling: Automatic retries with exponential backoff
Containerized Deployment: Complete Docker Compose orchestration
RESTful API: FastAPI with comprehensive validation
Job Status Tracking: SQLite database for job metadata and results

Architecture

Figure 1: System architecture showing data flow, component interactions, and technology stack

Tech Stack

API Framework: FastAPI
Message Queue: Apache Kafka + Zookeeper
Task Queue: Celery
Cache/Broker: Redis
Database: SQLite
AI Models: HuggingFace Transformers
Image Processing: Pillow
ML Framework: PyTorch
Containerization: Docker + Docker Compose

Installation

Prerequisites

Docker and Docker Compose
Python 3.12+ (for local development)
At least 4GB RAM available for containers

Quick Start with Docker Compose

Clone the repository:

git clone https://github.com/AkhileshMalthi/AI-image-analysis-service.git
cd AI-image-analysis-service

Create environment file (optional):

cp .env.example .env
# Edit .env if needed

Start all services:

docker-compose up --build

This will start:

Zookeeper (port 2181)
Kafka (port 9092)
Redis (port 6379)
FastAPI API (port 8000)
Celery Worker

Access the API documentation:

http://localhost:8000/docs

Local Development Setup

Create virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt
# or with uv
uv pip install -r requirements.txt

Set up environment variables:

cp .env.example .env

Initialize database:

python -c "from src.db import create_db_and_tables; create_db_and_tables()"

Run services separately:

Terminal 1 - API Server:

uvicorn src.main:app --reload --host 0.0.0.0 --port 8000

Terminal 2 - Kafka Consumer:

python -m src.tasks

Terminal 3 - Celery Worker:

celery -A src.worker.celery_app worker -l info --concurrency=2

API Documentation

Endpoints

1. Upload Image for Analysis

POST /upload-image

Upload an image for asynchronous analysis.

Request:

file: Image file (JPEG/PNG, max 10MB)
analysis_type: "classification" or "object_detection"

Response (202 Accepted):

{
  "job_id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "PENDING",
  "message": "Image uploaded successfully and queued for analysis"
}

Example with curl:

curl -X POST "http://localhost:8000/upload-image" \
  -F "file=@image.jpg" \
  -F "analysis_type=classification"

Example with Python:

import requests

url = "http://localhost:8000/upload-image"
files = {"file": open("image.jpg", "rb")}
data = {"analysis_type": "classification"}
response = requests.post(url, files=files, data=data)
print(response.json())

2. Get Job Status

GET /job-status/{job_id}

Retrieve the status and results of a job.

Response (200 OK):

{
  "job_id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "COMPLETED",
  "message": "Job is completed",
  "results": {
    "analysis_type": "classification",
    "predictions": [
      {"label": "golden retriever", "score": 0.9234},
      {"label": "Labrador retriever", "score": 0.0456}
    ]
  },
  "error_message": null,
  "created_at": "2026-01-22T10:30:00",
  "updated_at": "2026-01-22T10:30:15"
}

Example:

curl http://localhost:8000/job-status/550e8400-e29b-41d4-a716-446655440000

Job Status States

PENDING: Job created and waiting for processing
PROCESSING: Worker is currently analyzing the image
COMPLETED: Analysis finished successfully
FAILED: Analysis failed (see error_message)

Testing

Run the test suite:

# Run all tests
pytest

# Run with coverage
pytest --cov=src --cov-report=html

# Run specific test file
pytest tests/test_api.py -v

# Run specific test
pytest tests/test_api.py::test_upload_image_success -v

Test Coverage

API Endpoints: Input validation, status codes, Kafka integration
Celery Tasks: Job processing, error handling, retries
AI Inference: Model loading, classification, object detection
Utilities: Image validation, file handling

AI Models

Classification

Model: google/vit-base-patch16-224 (Vision Transformer)

Returns top-5 predictions with confidence scores
Supports 1000 ImageNet classes

Object Detection

Model: facebook/detr-resnet-50 (DETR)

Detects multiple objects in images
Returns bounding boxes and labels
Confidence threshold: 0.5

Custom Models

Override default models using environment variables:

CLASSIFICATION_MODEL=microsoft/resnet-50
OBJECT_DETECTION_MODEL=facebook/detr-resnet-101

Configuration

All configuration is managed through environment variables. See .env.example for available options.

Key Configuration Options

Variable	Default	Description
`KAFKA_BOOTSTRAP_SERVERS`	`localhost:9092`	Kafka broker address
`KAFKA_TOPIC`	`image-analysis-jobs`	Topic for job messages
`REDIS_BROKER_URL`	`redis://localhost:6379/0`	Celery broker URL
`DATABASE_URL`	`sqlite:///./data/jobs.db`	Database connection string
`MAX_FILE_SIZE`	`10485760`	Max upload size (10MB)
`CELERY_TASK_MAX_RETRIES`	`3`	Max retry attempts
`HUGGINGFACE_AUTH_TOKEN`	`None`	HF token for private models

Error Handling

Retry Strategy

The system implements intelligent retry logic:

Transient Errors (network issues, temporary unavailability):
- Automatic retry with exponential backoff
- Maximum 3 retry attempts
- Backoff: 60s, 120s, 240s
Permanent Errors (invalid image, model errors):
- No retry
- Job marked as FAILED
- Detailed error message stored

Common Error Scenarios

Image Too Large: HTTP 400 - Reduce image size
Invalid Format: HTTP 400 - Use JPEG/PNG
Job Not Found: HTTP 404 - Check job_id
Kafka Unavailable: HTTP 500 - Check Kafka service
Model Loading Failed: Job FAILED - Check HuggingFace connectivity

Monitoring

Health Check

curl http://localhost:8000/health

Celery Monitoring

View worker status:

celery -A src.worker.celery_app inspect active
celery -A src.worker.celery_app inspect stats

Logs

View service logs:

# All services
docker-compose logs -f

# Specific service
docker-compose logs -f api
docker-compose logs -f worker
docker-compose logs -f kafka

Production Deployment

Scaling

Scale Celery workers:

docker-compose up --scale worker=5

Database Migration

For production, consider migrating to PostgreSQL:

DATABASE_URL=postgresql://user:pass@localhost/dbname

Security Considerations

Use environment-specific .env files
Secure Kafka with SSL/SASL
Enable Redis authentication
Implement API rate limiting
Use reverse proxy (Nginx) for API
Set up monitoring (Prometheus + Grafana)

Project Structure

├── src/
│   ├── main.py                 # FastAPI application
│   ├── worker.py              # Celery worker setup
│   ├── tasks.py               # Celery tasks
│   ├── db.py                  # Database operations
│   ├── kafka_producer.py      # Kafka producer
│   ├── config.py              # Configuration
│   ├── utils.py               # Utility functions
│   └── models/
│       ├── __init__.py        # SQLModel definitions
│       └── schemas.py         # Pydantic schemas
├── tests/
│   ├── test_api.py           # API endpoint tests
│   ├── test_tasks.py         # Celery task tests
│   └── test_inference.py     # AI inference tests
├── Dockerfile.api            # API service Dockerfile
├── Dockerfile.worker         # Worker service Dockerfile
├── docker-compose.yml        # Service orchestration
├── requirements.txt          # Python dependencies
├── pyproject.toml           # Project metadata
├── .env.example             # Environment template
└── README.md                # This file

Contributing

Contributions are welcome! Please follow these guidelines:

Fork the repository
Create a feature branch
Write tests for new features
Ensure all tests pass
Submit a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Design Decisions

Why Kafka + Celery?

Kafka: Provides reliable message queuing with durability and replay capabilities
Celery: Handles distributed task execution with advanced retry logic
Separation of Concerns: Kafka for message routing, Celery for task execution

Why SQLite?

Simple deployment for demonstration
Easy to migrate to PostgreSQL/MySQL for production
Sufficient for moderate workloads

Model Caching

Models are loaded once per worker process and cached in memory to avoid:

Repeated downloads
Initialization overhead
Memory bloat

Support

For issues and questions:

Open an issue on GitHub
Check existing documentation
Review test cases for examples

Acknowledgments

HuggingFace for pre-trained models
FastAPI framework
Apache Kafka community
Celery project

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
assets		assets
examples		examples
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
ARCHITECTURE.md		ARCHITECTURE.md
Dockerfile.api		Dockerfile.api
Dockerfile.worker		Dockerfile.worker
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

AI Image Analysis Service

Features

Architecture

Tech Stack

Installation

Prerequisites

Quick Start with Docker Compose

Local Development Setup

API Documentation

Endpoints

1. Upload Image for Analysis

2. Get Job Status

Job Status States

Testing

Test Coverage

AI Models

Classification

Object Detection

Custom Models

Configuration

Key Configuration Options

Error Handling

Retry Strategy

Common Error Scenarios

Monitoring

Health Check

Celery Monitoring

Logs

Production Deployment

Scaling

Database Migration

Security Considerations

Project Structure

Contributing

License

Design Decisions

Why Kafka + Celery?

Why SQLite?

Model Caching

Support

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages