Skip to content
View olijacklu's full-sized avatar

Block or report olijacklu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
olijacklu/README.md

Welcome! My name is Oliver Jack and I am glad that you have stumbled across my page!🤗

I am a Data Scientist at Amazon with experience in agentic systems, causal ML, and retrieval optimization. I hold a Master's in Mathematics, Vision, and Learning (MVA) from ENS Paris-Saclay.

🔬 My Projects

Thesis evaluating retrieval strategies for enterprise knowledge bases connected to agentic systems. Studied agentic chunking, GraphRAG, query-aware hybrid fusion, and conditional reranking — achieving a 52% relative improvement in Top-1 Exact Match.

Implementation for the final project of the LLMs class at MVA, focusing on post-training pruning of MoE models.

Fine-tuning & evaluating pre-trained audio models (HuBERT & XLSR) on the ML-SUPERB dataset for monolingual and multilingual speech recognition tasks.

Implementation of a reinforcement learning solution for HIV treatment optimization using Fitted Q-Iteration (FQI) with XGBoost regression models.

Evaluation of various anomaly detection techniques on time series data & implementation of tools for benchmarking, ensemble creation, & performance analysis.

Implementation & analysis of the PCA-based K-Means Clustering method proposed in the paper "K-mean Clustering via Principcal Component Analysis" by Ding and He (2004).

Fine-tuning RoBERTa using Low-Rank Adaptation (LoRA) on different datasets.

💼 Experience

  • Data Scientist at Amazon — Building agentic solutions, knowledge base infrastructure, and causal ML models for supply chain optimization
  • Research Intern at Cambridge Centre for Alternative Finance (CCAF) — Anomaly detection and NLP for financial research

🎓 Education

  • M2 MVA (Mathematics, Vision, Learning) at ENS Paris-Saclay
  • Engineering Diploma (Exchange semester) at CentraleSupélec
  • Bachelor in Mathematics from University of Luxembourg

🛠️ Skills

  • Programming: Python, Java, SQL, R
  • AI & ML Frameworks: PyTorch, TensorFlow, scikit-learn
  • AWS: Bedrock, AgentCore, Strands
  • Data Engineering: PostgreSQL, MongoDB, Spark, Hadoop
  • Languages: English (C2), French (C1), German (C1), Luxembourgish (C2)

📫 How to reach me

Pinned Loading

  1. ML-SUPERB-Project ML-SUPERB-Project Public

    This repository contains code for fine-tuning & evaluating pre-trained audio models (like HuBERT & XLSR) on the ML-SUPERB dataset for monolingual and multilingual speech recognition tasks.

    Jupyter Notebook

  2. PruningOfExperts-Project PruningOfExperts-Project Public

    Forked from jlopetegui98/PruningOfExperts

    Implementation for the final project of the LLMs class at MVA, focusing on post-training pruning of MoE models.

    Jupyter Notebook

  3. AnomalyDetection-Project AnomalyDetection-Project Public

    Evaluation of various anomaly detection techniques on time series data & implementation of tools for benchmarking, ensemble creation, & performance analysis.

    Jupyter Notebook

  4. RL-HIV-Project RL-HIV-Project Public

    Implementation of a reinforcement learning solution for HIV treatment optimization using Fitted Q-Iteration (FQI) with XGBoost regression models.

    Python 1

  5. K-MeansPCA-Project K-MeansPCA-Project Public

    Implementation & analysis of the PCA-based K-Means Clustering method proposed in the paper "K-mean Clustering via Principcal Component Analysis" by Ding and He (2004).

    Jupyter Notebook

  6. MVA MVA Public

    Assignments & projects completed throughout my year of MVA.

    Jupyter Notebook 1