sparkioref

Benchmark the IO performance of Apache Spark (Scala/Python). Currently supported: csv/json, parquet, FITS.

Run the benchmark

Edit the run_benchmark.sh file with your data and cluster configuration, and launch it using

./run_benchmark.sh

Configuration:

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
notebooks		notebooks
pic		pic
project		project
src		src
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
run_benchmarks.sh		run_benchmarks.sh
run_benchmarks_local.sh		run_benchmarks_local.sh