Optimal Training Subset

The goal of the project is to identify a subset of the most representative examples from each class in an image classification problem (e.g., datasets like FashionMNIST, CIFAR-10, or CIFAR-100). The objective is to determine which images are sufficient to train a well-performing classifier that ensures optimal separation between classes.

Instalation

Clone repository
git clone https://gitlab-stud.elka.pw.edu.pl/mostasze/optimal_training_subset.git
Prepare enviroment
make_venv
Install requirements
make requirements

Running experiments

In order to replicate experiments run
make run_experiments.
To inspect results run
mlflow ui.

├── Makefile           <- Makefile with convenience commands
├── README.md          <- The top-level README for developers using this project.
├── data
│   └── raw            <- The original, immutable data dump.
│
├── docs               <- Documents
│
├── notebooks          <- Jupyter notebooks. Initial experiments.
│
├── pyproject.toml   
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── requirements.txt   <- The requirements file for reproducing the analysis environment
│
├── setup.cfg          <- Configuration file for flake8
│
└── optimal_training_subset   <- Source code for use in this project.
    │
    ├── data           <- Data management and loading
    │
    ├── evolutionary   <- Evolutionary strategies implementations
    │
    ├── experiments    <- Experiment scripts
    │
    ├── models         <- Model definitions and architectures
    │
    ├── optimizers     <- Hill climbing algorithms
    │
    ├── utils          <- Utility functions and helpers
    │   
    └── config.py      <- Configuration settings

Authors

Mateusz Ostaszewski
Michał Sadowski

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimal Training Subset

Instalation

Running experiments

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
docs		docs
notebooks		notebooks
optimal_training_subset		optimal_training_subset
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
uv.lock		uv.lock

Michall00/optimal-training-subset

Folders and files

Latest commit

History

Repository files navigation

Optimal Training Subset

Instalation

Running experiments

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages