DeepTune: Efficient Layer-wise Feature Extraction and Fine-Tuning of Foundation Models

Collaborative research project exploring optimal strategies for adapting foundation models to downstream vision tasks through efficient layer-wise feature extraction and fine-tuning.

Authors: Abdelrahman Werby, Jil Panter, Sejal Jadhav
Institution: University of Freiburg
Framework: PyTorch | AutoML

🎯 Overview

DeepTune is an AutoML-driven framework for efficiently adapting pre-trained foundation models (DINOv2) to diverse computer vision tasks. The project systematically explores the trade-offs between different fine-tuning strategies, layer selection, and computational efficiency across multiple datasets.

Computational Resources

GPU Hours: ~1000 hours on RTX 3090
Datasets Evaluated: 4 diverse vision tasks
Hyperparameter Configurations: 10+ strategies tested
Fine-tuning Approaches: Intermediate layers, head-tuning, full fine-tuning

Model Performance Comparison

Model	Fashion	Emotions	Flowers	Skin Cancer
QuickTune	0.78	0.32	0.14	0.75
DINOv2 + LP	0.83	0.37	1.00	0.60
DeepTune	0.88	0.44	0.99	0.80

🏗️ Architecture & Approach

DeepTune leverages DINOv2 as a foundation model and implements:

Search Space Exploration:
- Multiple fine-tuning strategies (LoRA, full fine-tuning, adapter layers)
- Layer-wise feature extraction optimization
- Hyperparameter tuning (learning rates, optimizers, schedulers)
Efficient Adaptors:
- Selective layer freezing/unfreezing
- Task-specific head adaptation
- Computational budget constraints
AutoML Pipeline:
- Automated search for optimal layer configurations
- Performance-efficiency trade-off analysis
- Time-budget aware model selection

🚀 Installation

Conda Environment

To create a conda environment with the necessary dependencies:

git clone https://github.com/sejal-prog/DeepTune.git
cd DeepTune
conda env create -f env.yaml
conda activate deeptune

📁 Project Structure

DeepTune/
├── config/              # Configuration files for experiments
├── deeptune/           # Core implementation
│   ├── models/         # Model architectures
│   ├── search/         # AutoML search logic
│   └── utils/          # Helper functions
├── data/               # Dataset directory (auto-downloaded)
├── env.yaml            # Conda environment
├── tune.py             # Main training script
└── test.py             # Evaluation script

🎯 Usage

Running Hyperparameter Search

Configure your experiment in config/deep_tune_config.yaml, then run:

python tune.py

This will:

Download datasets automatically (if not present)
Execute AutoML search across the defined search space
Save configurations and model checkpoints

Evaluating Best Configuration

After search completes, update config/test_config.yaml with the best configuration number and run:

python test.py

This generates predictions and saves them to data/exam_dataset/predictions.npy.

📊 Datasets

We evaluated DeepTune on 4 diverse datasets:

Dataset	Classes	Train Samples	Test Samples	Channels	Resolution
Fashion-MNIST	10	60,000	10,000	1	28×28
Flowers	102	5,732	2,457	3	512×512
Emotions	7	28,709	7,178	1	48×48
Skin Cancer	7	7,010	3,005	3	450×450

🔑 Key Findings

Layer-wise feature extraction from DINOv2 significantly improves efficiency compared to full fine-tuning while maintaining competitive accuracy.
Optimal layer selection varies by task complexity - shallow tasks benefit from early layers, while complex tasks require deeper features.
Fine-tuning strategy trade-offs: Full fine-tuning achieves highest accuracy but requires 5-10× more compute than adapter-based approaches.
Search space design matters: Constraining search to promising regions (based on task characteristics) reduces AutoML time by ~40%.

🛠️ Technical Stack

Framework: PyTorch
Foundation Model: DINOv2
AutoML: Custom search implementation
Optimization: AdamW, SGD, LoRA, various learning rate schedules
Experiment Tracking: Configuration logging, model checkpointing

🙏 Acknowledgments

This project was developed as part of the AutoML course at the University of Freiburg.

Contributors:

Abdelrahman Werby
Jil Panter
Sejal Jadhav rsity of Freiburg} }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepTune: Efficient Layer-wise Feature Extraction and Fine-Tuning of Foundation Models

🎯 Overview

Computational Resources

Model Performance Comparison

🏗️ Architecture & Approach

🚀 Installation

Conda Environment

📁 Project Structure

🎯 Usage

Running Hyperparameter Search

Evaluating Best Configuration

📊 Datasets

🔑 Key Findings

🛠️ Technical Stack

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config		config
deeptune		deeptune
LICENSE		LICENSE
README.md		README.md
env.yaml		env.yaml
test.py		test.py
tune.py		tune.py

License

sejal-prog/Deeptune

Folders and files

Latest commit

History

Repository files navigation

DeepTune: Efficient Layer-wise Feature Extraction and Fine-Tuning of Foundation Models

🎯 Overview

Computational Resources

Model Performance Comparison

🏗️ Architecture & Approach

🚀 Installation

Conda Environment

📁 Project Structure

🎯 Usage

Running Hyperparameter Search

Evaluating Best Configuration

📊 Datasets

🔑 Key Findings

🛠️ Technical Stack

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages