Skip to content

This repository will walk you through several katas for learning how to do joins with Spark+Scala.

Notifications You must be signed in to change notification settings

chandnipatelTW/join-transformations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Joining Data & Performance Tuning

Goals:

  • Understand how to do basic joins with spark in Shopping class

    • Practice with left, right, inner joins
  • Analyze DAGs in Employee Class

    • Broadcast joins
    • Shuffle Joins vs. Broadcast joins

How to run spark program through Intellij?

  • Build project using sbt package
  • Set main class as org.apache.spark.deploy.SparkSubmit
  • Set program arguments as --master local --class <main_class> target/scala-2.12/<jar_name>

About

This repository will walk you through several katas for learning how to do joins with Spark+Scala.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages