MLOps-CI-Pipeline

An end-to-end MLOps pipeline demonstrating automated retraining, evaluation-based promotion, versioned deployment, and data drift monitoring, designed for a bachelor-level engineering project.

Requirements

This project requires Python 3.12.x. Other versions are not officially supported.

Setup

1. Create virtual environment (Python 3.12 required)

macOS / Linux

python3.12 -m venv .venv
source .venv/bin/activate

Windows

py -3.12 -m venv .venv
.\.venv\Scripts\Activate

2. Upgrade pip

python -m pip install --upgrade pip

3. Install project in editable mode

pip install -e .

How to Run

Run tests

python -m pytest tests/ -v --tb=short

Run pipeline

run-pipeline --config src/config/pipeline.yaml

First run: If a dataset is missing dataset.yaml, the pipeline will prompt you interactively to provide target column and task type. This only happens once — subsequent runs skip the prompt automatically.

Pipeline Stages

The pipeline executes the following stages in order:

Stage	Status	Description
`preprocessing`	Implemented	Selects feature and target columns from each split, writes to `preprocessed/`
`training`	Placeholder	Model training — not yet implemented
`evaluation`	Placeholder	Model evaluation — not yet implemented
`deployment`	Placeholder	Model deployment — not yet implemented

Data Flow

data/raw/<dataset>/data.csv
        ↓  ingestion + versioning
data/processed/<dataset>/<version_id>/data.csv  +  train/  val/  test/
        ↓  preprocessing
data/processed/<dataset>/<version_id>/preprocessed/  train.csv  val.csv  test.csv

Preprocessing reads column definitions (target, features) from the versioned dataset.yaml — no separate config file is needed.

Adding Datasets

See data/raw/README.md for instructions on how to add new datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
.vscode		.vscode
ci		ci
data		data
docker		docker
docs		docs
experiments		experiments
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLOps-CI-Pipeline

Requirements

Setup

1. Create virtual environment (Python 3.12 required)

2. Upgrade pip

3. Install project in editable mode

How to Run

Pipeline Stages

Data Flow

Adding Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MLOps-CI-Pipeline

Requirements

Setup

1. Create virtual environment (Python 3.12 required)

2. Upgrade pip

3. Install project in editable mode

How to Run

Pipeline Stages

Data Flow

Adding Datasets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages