Data Platform Engineer @ Bumble Inc.
Release 1.1.0 Notable Features
- How to disable fwd and inverted index on a BYTES metric column. #11659
- Upgrade inconsistency: Confusing behavior with noDictionaryConfig and fieldConfig for enabling compression on raw forward index. #11745
- Enable Compression on BYTES star tree index #11485
- Minion Batch ingestion scheduling bottleneck #11282
- Minion HLL Merge Rollup aggregation function #9971
- Add compression configuration for aggregationConfigs to StartreeIndexConfigs #11744
- Adding Dynamic Env configuration to Apache Pinot #12307
- MergeRollupTask config behaviour enhancement #17015
- Adding aggregationConfigs example to starTreeIndexConfigs #258
- Note on
noDictionarycompression for metrics columns in forward-index.md #244 - Hint to avoid having too many files in inputDirURI #178
- Dynamic Environment Configuration documentation #289
CV (here)
- Data Platform Engineer @ Bumble Inc. Apr 2022 - Present
- Data Platform tools software development. (Python, Java, TypeScript, PHP)
- Maintaining PB scale data infrastructure. (Hadoop, Spark, Snowflake)
- Optimisation of Big Data pipelines. (Query profiling, Spark, Clickhouse, Pinot)
- Apache Pinot OS contributor.
- Upgraded backend engine for in-house data discovery tool using Apache Pinot with full ownership of Research and system design, GKE K8s provisioning, deployment and admin, Pinot deployment, Spark data generation, Schema generation, Validation, Performance/Cost tuning, UI redesign. Achieved a >10x speed improvement on page response times.
- High availability production Airflow deployment on K8s, maintenance, and admin
- Experience working with On-Prem and Cloud infrastructure.
- Technologies used Apache Pinot, Clickhouse, Snowflake, k8s, Python, Java, Spark, Hadoop, Kafka, Airflow, Flink, Prometheus, TeamCity
- Data Engineer @ Housing Anywhere B.v. August 2020 - Mar 2022
- Politecnico di Milano - Computer Science Engineering MSc and BSc

