🏦 Loan Payback Competition

This repository contains the code, experiments, and infrastructure for the Kaggle Playground Series – Season 5, Episode 11 competition. The task is to predict whether a borrower will successfully repay a loan using a realistic, synthetic binary-classification dataset.

🔍 Overview

Synthetic dataset generated from a deep-learning model trained on real loan-prediction data.
Goal: predict loan repayment based on borrower and loan attributes (e.g., income, debt ratio, credit history, loan purpose).
This repo implements an end-to-end ML workflow: data ingestion → EDA → feature engineering → modeling → evaluation → submission.

⚙️ Tech Stack

Category	Tools / Packages
Language	Python
Experiments	Marimo notebooks
Models	CatBoost, XGBoost, LGBM, Random Forest
Hyperparameter Tuning	Optuna
Tracking	MLflow
Reproducibility	Docker, shell scripts

🎯 Results

The target metric was the area under the ROC curve. Below are histograms of the leaderboard scores. The left plot shows all scores whereas the right plot zooms in by just displaying scores above 0.9:

All Leaderboard Scores	Scores above 0.9

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Docker		Docker
configs		configs
data/plots		data/plots
src		src
.gitignore		.gitignore
Create_basic_files.sh		Create_basic_files.sh
Create_dockerfile.sh		Create_dockerfile.sh
Create_project_structure.sh		Create_project_structure.sh
Dockerfile		Dockerfile
README.md		README.md
local_dev.sh		local_dev.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏦 Loan Payback Competition

🔍 Overview

⚙️ Tech Stack

🎯 Results

About

Uh oh!

Releases

Packages

Languages

AndreWal/Loan_Payback_Competition

Folders and files

Latest commit

History

Repository files navigation

🏦 Loan Payback Competition

🔍 Overview

⚙️ Tech Stack

🎯 Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages