Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README.md


Chapters

This directory contains all of the chapter codes for "Data Algorithms with Spark".


Bonus Chapters

The following directories are bonus chapters:

Bonus Chapter Description
Data Design Patterns Practical Data Design Patterns
Word Count Provided multiple solutions for word count problem using reduceByKey() and groupByKey() reducers.
Anagrams Find words, which are anagrams: provided multiple solutions for anagrams problem using reduceByKey(), groupByKey(), and combineByKey() reducers.
Lambda Expressions How to use Lambda Expressions in PySpark programs
TF-IDF Term Frequency - Inverse Document Frequency
K-mers K-mers for DNA Sequences
Correlation All vs. All Correlation

Data Algorithms with Spark