Hadoop - Benchmark

> Database > (Apache) Hadoop

1 - Tool

1.1 - Gridmix

MapReduce workload.

GridMix is a benchmark for Hadoop clusters. It works from a MapReduce job trace describing the workload. Such traces are typically generated by Rumen. Rumen mines JobHistory logs to extract meaningful data and stores it in an easily-parsed, condensed format or digest.

Advertising

1.2 - Yarn Scheduler Load Simulator (SLS)

1.3 - Distributed System Testing

1.4 - Hive

1.5 - Spark

2 - Documentation / Reference

db/hadoop/benchmark.txt · Last modified: 2018/06/19 14:48 by gerardnico