Skip to content
View mahdiqb's full-sized avatar

Block or report mahdiqb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mahdiqb/README.md

Hi there 👋

I'm Mahdi, a data geek with 9 years of experience in the data space. Throughout my career, I wore various hats on the engineering side (data engineer, tech lead, data architect, and ML Ops engineer) before switching to product management - I'm currently a Senior PM at Neo4j, focusing on graph analytics. Before transitioning to product, I mainly worked on designing and building petabyte-scale data platforms. I'm very passionate about open-source projects and enjoy working with data and designing scalable solutions. You can also read my content on Medium and via the Data Espresso newsletter.

The technologies I'm most familiar with:

  • Apache Spark (and larger Databricks ecosystem): I used it on a daily basis for nearly five years (and so we know each other pretty well).
  • dbt: It's the tool I currently work with the most. At Zendesk, I added dbt to our data stack and worked on defining and implementing standards, frameworks, and automation to better leverage it at scale. (Article from the Zendesk Engineering blog)
  • Snowflake: Was part of the core team that worked on transitioning from BigQuery to Snowflake at Zendesk.
  • AWS Ecosystem: Worked on it for 3 years, for various data and ML projects (mostly worked with Glue, EMR, Athena, ECS, SageMaker, Redshift, and the AWS CI/CD stack).
  • GCP Ecosystem: Worked on it for 3 years, mostly everything BigQuery and GKE.
  • Hadoop: Worked with Hadoop data lakes for two and a half years (it was the ecosystem that first introduced me to distributed systems and the paradigms/concepts behind them).
  • Other notable projects/tools: Apache Superset, Apache Airflow, Apache Zeppelin, Apache Hive, Dremio, Jupyter, and D3.js.
  • Languages I'm fluent in: Python, Java, and -obviously- SQL.
  • Other languages I used in the past: C++, C#, JavaScript (Angular, Node.js), and HTML+CSS.
  • IaC: Terraform and CloudFormation.

Notable published work:

Notable presentations and podcasts:

Pinned Loading

  1. modern_data_platform modern_data_platform Public

    Sample configuration to deploy a modern data platform.

    Shell 89 21

  2. dynamic_dashboards_generator dynamic_dashboards_generator Public

    A POC for an application that leverages notebooks to generate dynamic dashboards

    Jupyter Notebook 5 1

  3. dataforgoodfr/batch8_worldbank dataforgoodfr/batch8_worldbank Public

    Jupyter Notebook 9 17

  4. deprecated_mahdiqb.github.io deprecated_mahdiqb.github.io Public

    Forked from jarrekk/Jalpc

    Mahdi Karabiben's website

    CSS 1