Skip to content
View t0mpere's full-sized avatar
🦸‍♂️
Focusing
🦸‍♂️
Focusing
  • Bumble
  • London

Highlights

  • Pro

Block or report t0mpere

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
t0mpere/README.md

Hello there 🙂

Data Platform Engineer @ Bumble Inc.

OSS Contributions

Release 1.1.0 Notable Features

Issues

  • How to disable fwd and inverted index on a BYTES metric column. #11659
  • Upgrade inconsistency: Confusing behavior with noDictionaryConfig and fieldConfig for enabling compression on raw forward index. #11745
  • Enable Compression on BYTES star tree index #11485
  • Minion Batch ingestion scheduling bottleneck #11282
  • Minion HLL Merge Rollup aggregation function #9971

PRs

  • Add compression configuration for aggregationConfigs to StartreeIndexConfigs #11744
  • Adding Dynamic Env configuration to Apache Pinot #12307
  • MergeRollupTask config behaviour enhancement #17015

PRs

  • Adding aggregationConfigs example to starTreeIndexConfigs #258
  • Note on noDictionary compression for metrics columns in forward-index.md #244
  • Hint to avoid having too many files in inputDirURI #178
  • Dynamic Environment Configuration documentation #289

Work

  • Data Platform Engineer @ Bumble Inc. Apr 2022 - Present
    • Data Platform tools software development. (Python, Java, TypeScript, PHP)
    • Maintaining PB scale data infrastructure. (Hadoop, Spark, Snowflake)
    • Optimisation of Big Data pipelines. (Query profiling, Spark, Clickhouse, Pinot)
    • Apache Pinot OS contributor.
    • Upgraded backend engine for in-house data discovery tool using Apache Pinot with full ownership of Research and system design, GKE K8s provisioning, deployment and admin, Pinot deployment, Spark data generation, Schema generation, Validation, Performance/Cost tuning, UI redesign. Achieved a >10x speed improvement on page response times.
    • High availability production Airflow deployment on K8s, maintenance, and admin
    • Experience working with On-Prem and Cloud infrastructure.
    • Technologies used Apache Pinot, Clickhouse, Snowflake, k8s, Python, Java, Spark, Hadoop, Kafka, Airflow, Flink, Prometheus, TeamCity
  • Data Engineer @ Housing Anywhere B.v. August 2020 - Mar 2022

Education

  • Politecnico di Milano - Computer Science Engineering MSc and BSc

Pinned Loading

  1. apache/pinot apache/pinot Public

    Apache Pinot - A realtime distributed OLAP datastore

    Java 6k 1.4k

  2. APPiccio/ing-sw-2018-piccinini-peresson-peressini APPiccio/ing-sw-2018-piccinini-peresson-peressini Public

    Repo della prova finale di ingegneria del software

    Java 2