Scalability Tests were performed on a 9 node cluster of bare metal nodes.
Note: TPC-DS tests were done on Intel Ice Lake processors(Graph 2).
Few yarn and spark configuration details used on VM's listed below:
dfs.block.size | 512M |
yarn.scheduler.maximum-allocation-mb | 40g |
yarn.scheduler.maximum-allocation-vcores | 15 |
yarn.nodemanager.resource.cpu-vcores | 16 |
yarn.nodemanager.resource.memory-mb | 64g |
mapreduce.map.memory.mb | 1024 |
mapreduce.reduce.memory.mb | 3072 |
mapred.reduce.parallel.copies | 16 |
mapreduce.reduce.shuffle.parallelcopies | 14 |
mapreduce.map.java.opts | Xmx2048m |
spark.master | yarn |
spark.executor.cores | 5 |
spark.executor.memory | 18g |
spark.executor.instances | 3 |