Alluxio Resources

Find our rich collection of White Papers, Case Studies, Presentations, and Videos here.

Resources
Blog
Efficient Model Training in the Cloud with Kubernetes, TensorFlow, and Alluxio

A collaboration of Alibaba, Alluxio, and Nanjing University in tackling the problems of Deep Learning model training in the cloud. Our goal was to reduce the cost and complexity of data access for Deep Learning training in a hybrid environment, which resulted in over 40% reduction in training time and cost.

Blog
Everything you want to know about how to decouple SQL engines from Hive Data Warehouse

Are you using SQL engines, such as Presto, to query existing Hive data warehouse and experiencing challenges including overloaded Hive Metastore with slow and unpredictable access, unoptimized data formats and layouts such as too many small files, or lack of influence over the existing Hive system and other Hive applications?


Filter by type:
White Papers
Apache Spark DataFrame caching with Alluxio

Many organizations deploy Alluxio together with Spark for performance gains and data manageability benefits. Qunar recently deployed Alluxio in production, and their Spark streaming … Continued

On-Demand Videos
Alluxio at Spark Summit EU 2017

We briefly introduce Alluxio and present different ways Alluxio can help Spark jobs, along with best practices. We also discuss how Alluxio can be … Continued

Slides from our latest talks
Alluxio Product Overview

In the past year, the Alluxio project experienced significant improvement in performance and scalability and was extended with key new features including tiered storage, … Continued