🖼️ PyTorch Image Classifier

A comprehensive, step-by-step deep learning project for image classification using PyTorch. Learn to build, train, and deploy CNN models from scratch, and apply transfer learning for state-of-the-art results.

✨ Features

📚 8 Progressive Learning Steps - From data loading to deploying on your own images
🧠 Custom CNN Architecture - Build a convolutional neural network from scratch
🔄 Transfer Learning - Use pre-trained ResNet18 for ~90% accuracy
📈 Data Augmentation - Boost performance with image transformations
⚡ GPU Support - Automatic CUDA/MPS detection for fast training
🖼️ Classify Your Own Images - Use the trained model on any image

📋 Learning Path

Step	File	What You'll Learn	Difficulty
1	`steps/step1_data_loading.py`	Datasets, transforms, DataLoaders	⭐
2	`steps/step2_build_model.py`	CNN architecture (Conv, Pool, FC layers)	⭐⭐
3	`steps/step3_train_model.py`	Training loop, loss functions, optimizers	⭐⭐
4	`steps/step4_evaluate_and_predict.py`	Evaluation, predictions, confusion matrix	⭐⭐
5	`steps/step5_data_augmentation.py`	Image augmentation techniques	⭐⭐
6	`steps/step6_transfer_learning.py`	Pre-trained models, fine-tuning	⭐⭐⭐
7	`steps/step7_learning_rate_scheduler.py`	Learning rate scheduling strategies	⭐⭐⭐
8	`steps/step8_your_own_images.py`	Classify your own images!	⭐

🚀 Quick Start

1. Clone the Repository

git clone https://github.com/Anishyou/Imageclassifier.git
cd Imageclassifier

2. Install Dependencies

pip install -r requirements.txt

3. Run the Steps

cd steps

# Learn the fundamentals
python step1_data_loading.py      # Understand data loading
python step2_build_model.py       # Explore CNN architecture

# Train and evaluate
python step3_train_model.py       # Train the model (~10 min CPU, ~2 min GPU)
python step4_evaluate_and_predict.py  # See results

# Advanced techniques
python step5_data_augmentation.py     # Data augmentation
python step6_transfer_learning.py     # Transfer learning with ResNet
python step7_learning_rate_scheduler.py  # LR scheduling

# Use on your own images
python step8_your_own_images.py   # Classify any image!

📊 Dataset: CIFAR-10

Property	Value
Total Images	60,000 (50k train, 10k test)
Image Size	32×32 RGB
Classes	10

Classes: ✈️ airplane, 🚗 automobile, 🐦 bird, 🐱 cat, 🦌 deer, 🐕 dog, 🐸 frog, 🐴 horse, 🚢 ship, 🚚 truck

🏗️ Model Architecture

Custom CNN (from scratch)

Input (3×32×32)
    ↓
Conv1 (32 filters) → BatchNorm → ReLU → MaxPool → (32×16×16)
    ↓
Conv2 (64 filters) → BatchNorm → ReLU → MaxPool → (64×8×8)
    ↓
Conv3 (128 filters) → BatchNorm → ReLU → MaxPool → (128×4×4)
    ↓
Flatten (2048)
    ↓
FC1 (256) → ReLU → Dropout(0.5)
    ↓
FC2 (10) → Output (class scores)

Parameters: ~596K trainable parameters

📈 Results

Model	Accuracy	Training Time
Custom CNN (10 epochs)	~70-75%	~10 min (GPU)
With Data Augmentation	~75-80%	~15 min (GPU)
Transfer Learning (ResNet18)	~85-92%	~20 min (GPU)

💾 Trained Models

Pre-trained model weights are included in the models/ folder:

File	Description	How to Use
`models/best_model.pth`	Custom CNN trained on CIFAR-10	Load with `step2_build_model.ImageClassifier`
`models/feature_extractor_best.pth`	ResNet18 transfer learning	Load with `torchvision.models.resnet18`

Loading a Trained Model

import torch
import sys
sys.path.append('steps')
from step2_build_model import ImageClassifier

# Load custom CNN
model = ImageClassifier(num_classes=10)
checkpoint = torch.load('models/best_model.pth')
model.load_state_dict(checkpoint['model_state_dict'])
model.eval()

# Check accuracy achieved
print(f"Best accuracy: {checkpoint['best_acc']:.2f}%")

🖼️ Classify Your Own Images

cd steps
python step8_your_own_images.py --image path/to/your/image.jpg

Or use in code (from project root):

from PIL import Image
import torch
import torchvision.transforms as transforms
import sys
sys.path.append('steps')
from step2_build_model import ImageClassifier

# Classes
classes = ('airplane', 'automobile', 'bird', 'cat', 'deer',
           'dog', 'frog', 'horse', 'ship', 'truck')

# Load model
model = ImageClassifier(num_classes=10)
checkpoint = torch.load('models/best_model.pth')
model.load_state_dict(checkpoint['model_state_dict'])
model.eval()

# Prepare image
transform = transforms.Compose([
    transforms.Resize((32, 32)),
    transforms.ToTensor(),
    transforms.Normalize((0.4914, 0.4822, 0.4465), (0.2470, 0.2435, 0.2616))
])

image = Image.open('your_image.jpg').convert('RGB')
input_tensor = transform(image).unsqueeze(0)

# Predict
with torch.no_grad():
    output = model(input_tensor)
    _, predicted = output.max(1)
    
print(f"Prediction: {classes[predicted.item()]}")

📁 Project Structure

Imageclassifier/
├── 📂 steps/                         # All learning step files
│   ├── step1_data_loading.py         # Data loading tutorial
│   ├── step2_build_model.py          # CNN architecture
│   ├── step3_train_model.py          # Training loop
│   ├── step4_evaluate_and_predict.py # Evaluation
│   ├── step5_data_augmentation.py    # Augmentation
│   ├── step6_transfer_learning.py    # Transfer learning
│   ├── step7_learning_rate_scheduler.py  # LR scheduling
│   └── step8_your_own_images.py      # Use your own images
├── 📂 models/                        # Trained model weights
│   ├── best_model.pth                # Custom CNN weights
│   └── feature_extractor_best.pth    # Transfer learning weights
├── 📂 outputs/                       # Generated images & plots
│   └── (training curves, predictions, etc.)
├── 📂 data/                          # CIFAR-10 dataset (auto-downloaded)
├── 📄 requirements.txt               # Dependencies
├── 📄 LICENSE                        # MIT License
└── 📄 README.md                      # This file

💡 Key Concepts

🔹 Data Transforms

transform = transforms.Compose([
    transforms.ToTensor(),
    transforms.Normalize(mean, std)
])

🔹 Training Loop

for epoch in range(epochs):
    for images, labels in train_loader:
        optimizer.zero_grad()           # Reset gradients
        outputs = model(images)         # Forward pass
        loss = criterion(outputs, labels)  # Compute loss
        loss.backward()                 # Backward pass
        optimizer.step()                # Update weights

🔹 Evaluation Mode

model.eval()
with torch.no_grad():
    outputs = model(images)
    _, predicted = outputs.max(1)

🔹 Transfer Learning

from torchvision import models

# Load pre-trained ResNet18
model = models.resnet18(weights=models.ResNet18_Weights.IMAGENET1K_V1)

# Replace final layer for 10 classes
model.fc = nn.Linear(512, 10)

🛠️ Requirements

Python 3.8+
PyTorch 2.0+
torchvision
matplotlib
numpy
tqdm
Pillow

🤝 Contributing

Contributions are welcome! Feel free to:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

CIFAR-10 Dataset by Alex Krizhevsky
PyTorch for the amazing deep learning framework
torchvision for pre-trained models

Made with ❤️ for learning deep learning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🖼️ PyTorch Image Classifier

✨ Features

📋 Learning Path

🚀 Quick Start

1. Clone the Repository

2. Install Dependencies

3. Run the Steps

📊 Dataset: CIFAR-10

🏗️ Model Architecture

Custom CNN (from scratch)

📈 Results

💾 Trained Models

Loading a Trained Model

🖼️ Classify Your Own Images

📁 Project Structure

💡 Key Concepts

🛠️ Requirements

🤝 Contributing

📝 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
models		models
steps		steps
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

Anishyou/Imageclassifier

Folders and files

Latest commit

History

Repository files navigation

🖼️ PyTorch Image Classifier

✨ Features

📋 Learning Path

🚀 Quick Start

1. Clone the Repository

2. Install Dependencies

3. Run the Steps

📊 Dataset: CIFAR-10

🏗️ Model Architecture

Custom CNN (from scratch)

📈 Results

💾 Trained Models

Loading a Trained Model

🖼️ Classify Your Own Images

📁 Project Structure

💡 Key Concepts

🛠️ Requirements

🤝 Contributing

📝 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages