ML API — Kubernetes Deployment & Load Testing

This repo contains the code and infrastructure configuration for a machine-learning prediction API deployed on AWS EKS (Kubernetes) with Redis caching, Istio, and Grafana-based load-testing analysis (k6 + Prometheus).

I wrote a full project walkthrough here:

👉 Project Page: https://napronald.github.io/pages/mlapi.html

About this Repository

The goal of the project is to demonstrate how to run a real ML service in production-style infrastructure — including caching, autoscaling, and latency analysis under load.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
kubernetes		kubernetes
src		src
tests		tests
trainer		trainer
.gitignore		.gitignore
dockerfile		dockerfile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md
simulate.js		simulate.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML API — Kubernetes Deployment & Load Testing

About this Repository

About

Uh oh!

Releases

Packages

Languages

napronald/Full-End-to-End-Machine-Learning-API

Folders and files

Latest commit

History

Repository files navigation

ML API — Kubernetes Deployment & Load Testing

About this Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages