Skip to content

Hamza-Bouali/Hamza-Bouali

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

52 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Hamza Bouali

Data Engineer ETL Specialist Cloud Data Solutions

πŸ‘¨β€πŸ’» About Me

Data Engineer with hands-on expertise in designing and implementing scalable ETL/ELT pipelines, data warehousing, and cloud infrastructure. I specialize in transforming raw data into reliable, well-architected systems that power data-driven decision making.

  • πŸ”­ Currently working on building robust data pipelines, data warehouse optimization, and cloud-native data solutions
  • 🌱 Experienced with Apache Airflow, Apache Kafka, Spark, and modern data orchestration tools
  • πŸ’‘ Passionate about data quality, performance optimization, and scalable infrastructure design
  • πŸš€ Proven track record building ETL solutions that handle complex multi-source data ingestion and transformation
  • 🎯 Seeking Data Engineering internship (July 2025) to expand expertise in large-scale data systems

πŸŽ“ Education

Data Science & Knowledge Engineering β€” ESI (Γ‰cole des Sciences de l'Information), Kenitra, Morocco (2023 - 2026)

  • Curriculum: Big Data, Cloud Computing, Data Engineering, DevOps, Software Engineering, Machine Learning
  • Applied research in data systems and national hackathons

Preparatory Classes (CPGE) — OMAR IBN AL KHATAB, Meknès, Morocco (2021 - 2023)

  • Mathematics, Physics, and logical reasoning with focus on engineering preparation

Baccalaureate in Mathematical Sciences — Fès, Morocco (2021)

πŸ’Ό Professional Experience

Data Engineering Intern at Veolia, Rabat, Morocco (Feb 2026- present )

  • manage cloud data infrastructure in Data Fabric
  • build agnostic data infrastructure for data products
  • optimize the data pipelines in the cloud

Data Analyst Intern at Decathlon, Casablanca, Morocco (June - August 2025)

  • Built ETL pipelines for multi-source internal data extraction and transformation
  • Designed and deployed dynamic dashboards for store performance analysis and strategic insights
  • Contributed to decision-making for new store opening in SalΓ© (2025)

Database Administration & Backend Developer at OBG Incub, Rabat, Morocco (January - May 2025)

  • Developed scalable backend services and data pipelines using Django and FastAPI
  • Designed and optimized database schemas for improved query performance
  • Implemented RAG pipeline backend logic with vector database integration (Qdrant)
  • Deployed containerized services on AWS ECS with CI/CD automation using GitHub Actions
  • Collaborated with DevOps team on infrastructure and monitoring solutions

Data Engineer Intern at AILAND, Rabat, Morocco (July 2024)

  • Cleaned, transformed, and analyzed social media data at scale
  • Implemented NLP model fine-tuning for localized data processing
  • Optimized data pipeline models improving processing efficiency by 40%

πŸ› οΈ Technical Skills

Data Engineering & Pipeline Development

Python Apache Airflow Apache Kafka PySpark ETL/ELT Kestra SQL Advanced Pandas

Data Warehousing & Analytics

Data Warehouse Design Power BI Tableau SSIS Streamlit

Databases & Data Storage

PostgreSQL MySQL MongoDB SQL Server Redshift Azure Data Warehouse Neo4j

Cloud & Infrastructure

AWS Docker Docker Compose AWS ECS CI/CD Linux Bash

Data Quality & Monitoring

Data Quality Assurance Monitoring & Logging MLflow Git

Backend Development

Django FastAPI Flask REST APIs JavaScript TypeScript

πŸš€ Featured Projects

Banking BI System & Data Warehouse

Enterprise data warehouse and analytics platform for banking operations

  • Tech Stack: SQL Server, SSIS, Power BI, Python
  • Key Achievements: Designed and implemented star schema data warehouse, built ETL processes for complex banking data transformation, created interactive KPI dashboards, optimized DirectQuery connections for real-time analytics
  • Impact: Enabled business intelligence and predictive insights for strategic decision-making

MemorAI

Intelligent data management system with multi-modal support

  • Tech Stack: Django REST, AWS (S3, DynamoDB, Lambda, EC2), Qdrant Vector Database, GitHub Actions
  • Data Engineering Focus: Designed scalable data pipelines for ingestion and storage, implemented semantic search with vector indexing, built robust data persistence layer on cloud infrastructure
  • Features: Multi-turn context persistence, semantic search, RAG-based retrieval

Call center operations platform with data processing capabilities

  • Tech Stack: Python, Django, data pipeline architecture
  • Features: Real-time data processing, analytics infrastructure

Healthcare platform with data management systems

  • Tech Stack: Django, PostgreSQL, REST APIs
  • Data Features: Patient records management, secure data handling

ML & Deep Learning Models From Scratch

Mathematical implementation of machine learning algorithms

  • Built 6 ML models: Linear Regression, Logistic Regression, KNN, Decision Trees, Random Forest, SVM
  • Implemented 4 Deep Learning architectures: MLP, CNN, RNN, Autoencoder
  • Complete backpropagation and optimization algorithms

πŸ“œ Professional Certifications

Data Engineer Associate Python Data Associate SQL Associate Supervised Machine Learning

πŸ† Awards & Recognition

  • 2nd Place β€” Hackathon MDFDS (Code ESI Club)
  • 3rd Place β€” EAIC Data Competition
  • Top 25 out of 400 β€” Think AI (2nd Edition)
  • 20th Place β€” MCPC - Moroccan Competitive Programming Championship

πŸ‘₯ Leadership & Volunteering

Co-Head of Competitive Programming Cell at CODE-ESI, Rabat (September 2024 - June 2025)

  • Mentoring engineers in algorithmic problem-solving

Treasurer at JCMP-ESI, Rabat (September - December 2024)

  • Managed finances and budgeting

Sponsorship & Event Committee Member for Moroccan Days of Future Data Scientists (May 2024 - Present)

  • Coordinated partnerships for data science events

🌐 Languages

English French Arabic

πŸ’‘ Core Competencies

Pipeline Architecture: ETL/ELT design and optimization, data orchestration, workflow automation, scheduling and dependency management

Data Warehousing: Schema design (star/snowflake), dimensional modeling, fact/dimension tables, slowly changing dimensions, query optimization

Data Processing: Distributed processing with Spark, stream processing with Kafka, batch processing optimization, data transformation logic

Database Management: Schema design, query optimization, indexing strategies, performance tuning, backup/recovery

Cloud Infrastructure: AWS ecosystem (EC2, S3, RDS, Lambda), containerization with Docker, infrastructure as code, cost optimization

Data Quality: Validation frameworks, anomaly detection, data profiling, pipeline monitoring, error handling

πŸ“ˆ GitHub Stats

GitHub Stats GitHub Streak

🀝 Let's Connect

Portfolio LinkedIn Email Phone


Open to collaboration on Data Engineering, ETL, and Cloud Data Solutions

Last Updated: January 2026

### πŸ’Ό Open to Collaboration and New Opportunities!

I'm always eager to work on innovative projects, contribute to open-source, and expand my expertise. If you're looking for a dedicated professional who combines data engineering with full-stack development skills, let's connect!

πŸ” Actively seeking a 2-month Data Engineering/MLOps internship starting July 2025

About

Config files for my GitHub profile.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors