Skip to content

gavr-dev/hadoop-load-testing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop Load Testing Utils

The repository contains a set of utilities for preparing data (regular and parquet files) and generating load on Apache Hadoop.

HDFS file generator

python files-generator.py \
  --nameservice '' \
  --parquet \
  --hdfs_upload_folder '' \
  --folder_name '' \
  --sub_folder_count 10 \
  --files_count 500 \
  --file_size 5 \
  --hive_server "" \
  --database default \
  --local_tmp_folder /tmp

HDFS testing

python load-hdfs-testing.py \
  --path '' \
  --parallel_threads 4 \
  --request_count 10 \
  --download_folder /tmp

Hive testing

python load-hive-testing.py \
  --hive_server '' \
  --parallel_threads 4 \
  --request_count 10 \
  --database '' \
  --table '' \
  --select

Spark testing

python load-spark-testing.py \
  --nameservice '' \
  --parallel_threads 4 \
  --request_count 10 \
  --path '' \
  --select

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages