Skip to content

devel0pez-com/pandora

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pandora

          ╔══════════════════════════════════╗
          ║      ✦  P A N D O R A  ✦        ║
          ╠══════════════════════════════════╣
          ║                                  ║
          ║           ███    ███             ║
          ║          ██  █  █  ██            ║
          ║         ██    ██    ██           ║
          ║        ██   ██  ██   ██          ║
          ║       ██  ██  ██  ██  ██         ║
          ║        ██   ██  ██   ██          ║
          ║         ██    ██    ██           ║
          ║          ██  █  █  ██            ║
          ║           ███    ███             ║
          ║                                  ║
          ║      { nix flake toolbox }       ║
          ║                                  ║
          ╠══════════════════════════════════╣
          ║  ░░░░░░░░░░░░░░░░░░░░░░░░░░░░  ║
          ║  ░░░░░░░░░░░░░░░░░░░░░░░░░░░░  ║
          ╚══════════════════════════════════╝

Once opened, there's no going back.

A Nix flake toolbox for Apache data projects. Each subfolder is an independent dev shell.

Projects

Folder Project Description
datafusion-comet/ Apache DataFusion Comet Spark plugin — accelerates queries with native DataFusion execution
auron/ Apache Auron Spark & Flink plugin — native vectorized execution via DataFusion
datafusion/ Apache DataFusion Standalone Rust SQL query engine

Comet vs Auron

Both are Spark plugins that use Apache DataFusion (Rust) to accelerate query execution. They replace Spark JVM operators with native implementations.

Comet Auron
Apache status Incubating Incubating (formerly Blaze)
Approach Replaces individual operators (scan, filter, aggregate) Converts entire stages to native execution
Shuffle JVM-based (Comet shuffle manager) Native Arrow-based shuffle
Flink support No Yes
Pre-built JAR Yes (Maven Central) No (must compile from source)
Spark versions 3.4, 3.5 3.0 - 4.1
Spark UI operators CometScan, CometHashAggregate, ... NativeParquetScan, NativeHashAggregate, ...
offHeap required Yes (spark.memory.offHeap.enabled=true) No
Key config spark.plugins=...CometPlugin spark.sql.extensions=...AuronSparkSessionExtension

When to use Comet: Quick setup, pre-built JAR available, mature operator coverage.

When to use Auron: Broader Spark version support, native shuffle, Flink workloads, more aggressive native conversion.

Usage

cd <project-folder>
nix develop
# Each shell provides helper commands (run 'menu' to see them)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors