Presentation Slides from our Latest Talks - Archives

On Demand Video

How to Build a new Under Filesystem in Alluxio: Apache Ozone as an Example

In Alluxio, an Under File System is the plugin to connect to any file systems or object stores, so users can mount different storages … Continued

On Demand Video

Bursting Spark or Presto Jobs to AWS using Alluxio

In this office hour, we demonstrate how a “zero-copy burst” solution helps to speed up Spark and Presto queries in the public cloud while … Continued

On Demand Video

Tech Talk: Build a hybrid data lake and burst processing to Google Cloud Dataproc with Alluxio

Join us for this tech talk where we will show you how Alluxio can help burst your private computing environment to Google Cloud, minimizing … Continued

On Demand Video

Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration Between Presto & Alluxio

For many latency-sensitive SQL workloads, Presto is often bound by retrieving distant data. In this talk, Rohit Jain, James Sun from Facebook and Bin … Continued

On Demand Video

Burst Presto & Spark workloads to AWS EMR with no data copies

In this talk, we will show you how to leverage any public cloud (AWS, Google Cloud Platform, or Microsoft Azure) to scale analytics workloads … Continued

On Demand Video

Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio

Today, many people run deep learning applications with training data from separate storage such as object storage or remote data centers. This presentation will … Continued

On Demand Video

Scalable and Highly-available Distributed File System Metadata Service Using gRPC, RocksDB and RAFT

Alluxio (alluxio.io) is an open-source data orchestration system that provides a single namespace federating multiple external distributed storage systems. It is critical for Alluxio … Continued

On Demand Video

Optimizing Query Performance by Decoupling Presto and Hive Data Warehouse

Ideally, Presto would access data independently from how the data was originally stored or managed. Alluxio, as a data orchestration layer provides the physical … Continued

On Demand Video

Bursting Apache Spark Workloads to the Cloud on Remote Data

Accessing data to run analytic workloads in Spark across data centers and/or clouds can be challenging. Additionally, network I/O can bottleneck Spark jobs that … Continued

Slides from our latest talks