Scalability Tests were performed on a 9 node cluster of bare metal nodes.
Note: TPC-DS tests were done on Intel Ice Lake processors(Graph 2).
Few yarn and spark configuration details used on VM's listed below:
| dfs.block.size | 512M |
| yarn.scheduler.maximum-allocation-mb | 40g |
| yarn.scheduler.maximum-allocation-vcores | 15 |
| yarn.nodemanager.resource.cpu-vcores | 16 |
| yarn.nodemanager.resource.memory-mb | 64g |
| mapreduce.map.memory.mb | 1024 |
| mapreduce.reduce.memory.mb | 3072 |
| mapred.reduce.parallel.copies | 16 |
| mapreduce.reduce.shuffle.parallelcopies | 14 |
| mapreduce.map.java.opts | Xmx2048m |
| spark.master | yarn |
| spark.executor.cores | 5 |
| spark.executor.memory | 18g |
| spark.executor.instances | 3 |