Overview
Several pipeline phases between Normalise (Phase 2) and Harmonise (Phase 9) have been migrated to Polars, but we do not yet have clear measurements showing the performance impact. A performance report is required to compare these phases against their legacy equivalents. This will help confirm improvements and highlight any areas that still require optimisation.
Tech Approach
- Benchmark only Phases 2 to 9: NormalisePhase, ParsePhase, ConcatFieldPhase, and all subsequent phases up to HarmonisePhase.
- Use representative datasets to test both legacy and Polars implementations under the same conditions.
- Measure runtime for each phase.
- Ensure tests are performed on consistent hardware to avoid environmental differences.
- Produce a concise report with tables or charts summarising the results and any observed regressions or anomalies.
Acceptance Criteria / Tests
Resourcing & Dependencies
- Depends on Polars versions of Phases 2 to 9 being stable.
- Can be completed by any developer able to run and compare both pipelines.
- No external team involvement expected.
Overview
Several pipeline phases between Normalise (Phase 2) and Harmonise (Phase 9) have been migrated to Polars, but we do not yet have clear measurements showing the performance impact. A performance report is required to compare these phases against their legacy equivalents. This will help confirm improvements and highlight any areas that still require optimisation.
Tech Approach
Acceptance Criteria / Tests
Resourcing & Dependencies