Alluxio at Strata + Hadoop World San Jose 2017
Calvin Jia introduces Alluxio, explain how Alluxio can help Spark be more effective, show benchmark results with Spark RDDs and DataFrames, and describe production deployments with both Alluxio and Spark working together.
Tags: alluxio engineering, apache spark, aws s3, ceph, conference, data, data engineering, data orchestration, Gluster, Google Cloud Storage, hdfs, NFS, performance, scale, spark, storage