Evaluating Apache Spark and Alluxio for Data Analytics
This whitepaper details how to evaluate Alluxio’s data orchestration platform as a distributed cache for Apache Spark in a public cloud or on-premises. We discuss best practices and benchmarking results with a combination of standard industry benchmarking suites, such as TPC-DS and HiBench, on cloud storage.
Tags: analytics, benchmarks, cloud, data analytics, spark, whitepaper