kuburnetes-gitops-platform — GitOps Delivery Platform on K8s

A complete GitOps pipeline where merging to a branch is the deployment. ArgoCD watches the repo, any change to a manifest automatically syncs to the cluster. Built to demonstrate declarative infrastructure, multi-environment promotion, and real observability — all at zero cloud cost.

Stack

Layer	Technology
Cluster	k3s (local, single-node, Linux Mint VM on VirtualBox)
GitOps controller	ArgoCD
App packaging	Helm
CI/CD	GitHub Actions
Container registry	GitHub Container Registry (ghcr.io)
Ingress	NGINX Ingress Controller
Metrics	Prometheus (kube-prometheus-stack)
Dashboards	Grafana
Application	Python FastAPI
Language	Python 3.11

Phases

Phase 1 — Local cluster bootstrap (k3s + ArgoCD)
Phase 2 — Application + Helm chart
Phase 3 — GitOps wiring (ArgoCD Applications, dev + prod)
Phase 4 — CI/CD pipeline (GitHub Actions)
Phase 5 — Observability (Prometheus + Grafana)
Phase 6 — Security hardening
Phase 7 — Documentation + EKS path

Architecture

GitHub push (main)
        │
        ▼
GitHub Actions Pipeline
  ├── test        — pytest
  ├── build       — docker build
  ├── push        — ghcr.io/skanderba8/k8s-gitops-platform/demo-api:<git-sha>
  └── update-tag  — commits new image.tag → manifests/envs/dev/values.yaml
        │
        ▼
ArgoCD (running in cluster, polling this repo)
  ├── Detects drift in manifests/envs/dev/
  └── Syncs to k3s cluster automatically
        │
        ▼
k3s Cluster (Linux Mint VM, VirtualBox, SSH via port 2222)
  ├── Namespace: dev
  │     ├── Deployment  (demo-api, 1 replica)
  │     ├── Service     (ClusterIP)
  │     └── Ingress     (NGINX)
  ├── Namespace: prod
  │     ├── Deployment  (demo-api, 2 replicas)
  │     ├── Service     (ClusterIP)
  │     └── Ingress     (NGINX)
  └── Namespace: monitoring
        ├── Prometheus
        └── Grafana

Traffic: Client → NGINX Ingress → Service → Pod No direct pod access. All traffic routed through the ingress controller.

Folder Structure

k8s-gitops-platform/
│
├── app/                              # Application source
│   ├── main.py                       # FastAPI — /health, /info, /items
│   ├── requirements.txt
│   └── Dockerfile
│
├── charts/                           # Helm chart
│   └── demo-api/
│       ├── Chart.yaml
│       ├── values.yaml               # Base defaults
│       └── templates/
│           ├── deployment.yaml
│           ├── service.yaml
│           └── ingress.yaml
│
├── manifests/                        # ArgoCD watches this
│   ├── argocd/
│   │   ├── project.yaml              # AppProject (RBAC boundary)
│   │   ├── app-dev.yaml              # ArgoCD Application → dev
│   │   └── app-prod.yaml             # ArgoCD Application → prod
│   └── envs/
│       ├── dev/
│       │   └── values.yaml           # image.tag auto-updated by CI
│       └── prod/
│           └── values.yaml           # image.tag promoted manually
│
├── monitoring/
│   ├── prometheus-values.yaml        # kube-prometheus-stack Helm overrides
│   └── grafana-dashboard.json        # App dashboard (loaded via ConfigMap)
│
├── .github/
│   └── workflows/
│       ├── ci.yaml                   # test → build → push → update-tag
│       └── promote.yaml              # manual promote dev → prod
│
├── docs/
│   ├── local-setup.md
│   └── eks-path.md                   # EKS production path (documented, not deployed)
│
└── README.md

The Application — `demo-api`

A lightweight FastAPI service with real Prometheus metrics. Enough to generate meaningful dashboards without distracting from the platform.

Endpoints

Method	Path	Description
GET	`/health`	Liveness probe — `{"status": "ok"}`
GET	`/info`	App version, environment name, pod hostname
GET	`/items`	Returns a mock item list (simulates read load)
POST	`/items`	Accepts JSON body, returns it with a generated ID

prometheus-fastapi-instrumentator auto-exposes /metrics — request count, latency histograms, error rates per endpoint. Prometheus scrapes this. Grafana dashboards it.

Environments

Environment	Namespace	Replicas	Deploy trigger
dev	`dev`	1	Automatic — every push to `main`
prod	`prod`	2	Manual — `promote` workflow dispatch in GitHub Actions

Same Helm chart, different values.yaml per environment. The only difference is the image tag and replica count.

CI/CD Pipeline

test → build → push → update-tag → (manual) promote

Stage	What it does
`test`	pytest against the FastAPI app
`build`	`docker build`
`push`	Push to ghcr.io, tag = git commit SHA
`update-tag`	Commit new `image.tag` to `manifests/envs/dev/values.yaml`
`promote`	Manual workflow dispatch — copies dev tag to prod values

Secrets (GITHUB_TOKEN) injected automatically by GitHub Actions — never in code, never in Git.

FinOps Notes

Default setup costs zero:

k3s runs on a local Linux Mint VM — no cloud compute
GitHub free tier covers Actions minutes, container registry, and repo hosting
ArgoCD, Prometheus, Grafana are open source

EKS production path (documented in `docs/eks-path.md`, not deployed)

Resource	Type	Est. Cost
EKS control plane	Managed	~$0.10/hr
EC2 node (×1)	t3.small spot	~$0.007/hr
Total (idle)		~$15–20/month

FinOps decisions for EKS path:

Spot instances — 60–70% cheaper than on-demand for non-critical workloads
Single node group — minimum viable cluster, scale up only when needed
terraform destroy runbook — tear down after demos, no idle cost
No NAT Gateway — public subnets with restricted SGs to avoid $0.045/hr NAT charge
GitHub registry (ghcr.io) — free, avoids ECR costs ($0.10/GB/month) entirely

Security Highlights

Area	Implementation
ArgoCD scope	`AppProject` limits ArgoCD to specific namespaces and repos only
Container	Non-root UID (1000), read-only root filesystem, no privilege escalation
Resource limits	CPU + memory limits on all pods — no runaway resource consumption
Secrets	GitHub Actions `GITHUB_TOKEN` — auto-injected, never stored manually
Registry	ghcr.io — private by default, scoped to repo
RBAC	ArgoCD `ServiceAccount` has minimum required cluster permissions
Image tagging	Git SHA tags — no `latest` in production, full traceability

Workflow

Step 1 — Start the VM and SSH in

# On Windows Git Bash
VBoxManage startvm "Linux Mint" --type headless
ssh -p 2222 skander@127.0.0.1

Step 2 — Access ArgoCD UI

# On Windows Git Bash (new terminal)
ssh -p 2222 -L 8080:localhost:8080 skander@127.0.0.1

# On VM
kubectl port-forward svc/argocd-server -n argocd 8080:443

Open https://localhost:8080 in Windows browser. Login: admin / (secret from step below)

kubectl get secret argocd-initial-admin-secret -n argocd -o jsonpath="{.data.password}" | base64 -d && echo

Step 3 — Apply ArgoCD Applications

kubectl apply -f manifests/argocd/

ArgoCD takes ownership of dev and prod namespaces from this point.

Step 4 — Trigger a deployment

Push any change to main. Watch in the ArgoCD UI:

GitHub Actions runs — image built, pushed, tag committed
ArgoCD detects drift in manifests/envs/dev/values.yaml
ArgoCD syncs — new pod rolling out in dev namespace

Step 5 — Promote to prod

GitHub → Actions → promote workflow → Run workflow manually. ArgoCD detects the tag change in manifests/envs/prod/values.yaml and syncs prod.

Step 6 — Check observability

# On Windows Git Bash (new terminal)
ssh -p 2222 -L 3000:localhost:3000 skander@127.0.0.1

# On VM
kubectl port-forward svc/grafana -n monitoring 3000:3000

Open http://localhost:3000 — dashboard shows request rate, error rate, p99 latency.

Local Development

cd app
python3 -m venv venv
source venv/bin/activate       # Linux/VM
pip install -r requirements.txt
uvicorn main:app --reload
# http://127.0.0.1:8000

docker build -t demo-api .
docker run -p 8000:8000 demo-api

API Reference

Method	Endpoint	Body	Response
GET	`/health`	—	200 + `{"status": "ok"}`
GET	`/info`	—	200 + version, env, hostname
GET	`/items`	—	200 + JSON array
POST	`/items`	`{"name": "..."}`	201 + item with ID
GET	`/metrics`	—	Prometheus text format

Problems & Fixes

Problem	Fix
k3s doesn't run on Windows or Git Bash	Ran k3s on a Linux Mint VM via VirtualBox instead
VM on NAT — no direct SSH access	Added VirtualBox port forward rule: host 2222 → guest 22
`sudo kubectl` required every time	Copied k3s kubeconfig to `~/.kube/config` and set `KUBECONFIG` env var
Helm not installed on VM	Installed via `curl https://raw.githubusercontent.com/helm/helm/main/scripts/get-helm-3 \| bash`
`python3-venv` missing on VM	Installed via `sudo apt install python3.12-venv -y`
Docker not installed on VM	Installed via `sudo apt install docker.io -y`, added user to docker group
GitHub push rejected password auth	GitHub dropped password auth — switched to SSH key authentication
GitHub PAT gave 403 on push	Abandoned HTTPS+token, switched to SSH key added to GitHub account
ArgoCD sync `Unknown` — out-of-bounds symlinks	`app/venv/` was committed to GitHub — removed with `git rm -r --cached app/venv/`
ArgoCD not picking up new config after `kubectl apply`	Deleted and recreated Application resources to force fresh config
ghcr.io image pull 403	Package was private by default — changed to public in GitHub Package settings
CI updating wrong file — pods stuck on `latest`	Switched from inline `values:` block to `parameters:` in ArgoCD Application manifest
`pytest` not found in CI	Added `pytest` to `app/requirements.txt`
Image tag lowercase error in CI	`github.repository_owner` returns mixed case — piped through `tr '[:upper:]' '[:lower:]'`
ArgoCD not syncing after tag update	Deleted and recreated Application to force ArgoCD to re-read the manifest

Things to Improve

HTTPS on the ingress via cert-manager + Let's Encrypt
HPA (Horizontal Pod Autoscaler) based on Prometheus custom metrics
Loki log aggregation (currently Prometheus + Grafana only)
Sealed Secrets or External Secrets Operator for in-cluster secret management
Trivy image scanning stage in GitHub Actions
Renovate bot for automated Helm chart and image dependency updates
Webhook-based ArgoCD sync instead of polling
Multi-node k3s setup to simulate real cluster scheduling

Dev Environment

OS: Windows (host), Linux Mint (VM via VirtualBox)
SSH: ssh -p 2222 skander@127.0.0.1
Editor: VS Code
Tools: Git, Docker, kubectl, Helm, k3s
Accounts: GitHub (free tier)

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
app		app
charts/demo-api		charts/demo-api
manifests		manifests
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kuburnetes-gitops-platform — GitOps Delivery Platform on K8s

Stack

Phases

Architecture

Folder Structure

The Application — `demo-api`

Endpoints

Environments

CI/CD Pipeline

FinOps Notes

EKS production path (documented in `docs/eks-path.md`, not deployed)

Security Highlights

Workflow

Step 1 — Start the VM and SSH in

Step 2 — Access ArgoCD UI

Step 3 — Apply ArgoCD Applications

Step 4 — Trigger a deployment

Step 5 — Promote to prod

Step 6 — Check observability

Local Development

API Reference

Problems & Fixes

Things to Improve

Dev Environment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

kuburnetes-gitops-platform — GitOps Delivery Platform on K8s

Stack

Phases

Architecture

Folder Structure

The Application — demo-api

Endpoints

Environments

CI/CD Pipeline

FinOps Notes

EKS production path (documented in docs/eks-path.md, not deployed)

Security Highlights

Workflow

Step 1 — Start the VM and SSH in

Step 2 — Access ArgoCD UI

Step 3 — Apply ArgoCD Applications

Step 4 — Trigger a deployment

Step 5 — Promote to prod

Step 6 — Check observability

Local Development

API Reference

Problems & Fixes

Things to Improve

Dev Environment

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

The Application — `demo-api`

EKS production path (documented in `docs/eks-path.md`, not deployed)

Packages