Alluxio Resources

Find our rich collection of White Papers, Case Studies, Presentations, and Videos here.

On Demand Video

Real-Time Analytics: Going Beyond Stream Processing With Apache Pinot

Streaming systems form the backbone of the modern data pipeline as the stream processing capabilities provide insights on events as they arrive. But what … Continued

On Demand Video

ML-Based SQL Query Resource Usage Prediction

With the advent of the Big Data era, it is usually computationally expensive to calculate the resource usages of a SQL query. Can we … Continued

On Demand Video

The Architecture Overview of OceanBase DataBase

OceanBase Database, is an open-source, distributed Hybrid Transactional/Real-time Operational Analytics (HTAP) database management system that has set new world records in both the TPC-C … Continued

On Demand Video

Deconstructing a Machine Learning Pipeline with Virtual Data Lake

As more and more companies turn to AI / ML / DL to unlock insight, AI has become this mythical word that adds unnecessary … Continued

Case Study

Achieving Hybrid and Multi-Cloud Architecture With Application Portability

By Fortune 50 Technology Company

A Fortune 50 technology company has successfully implemented Alluxio to achieve a hybrid-cloud strategy, become multi-cloud ready, cut costs, and boost agility. … Continued

On Demand Video

Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds

Alluxio foresaw the need for agility when accessing data across silos separated from compute engines like Spark, Presto, Tensorflow and PyTorch. Embracing the separation … Continued

Case Study

When AI Meets Alluxio at bilibili | Building an Efficient AI Platform for Data Preprocessing and Model Training

Lei Li, AI Platform Lead, and Zifan Ni, Senior Software Engineer from Bilibili, share how they applied Alluxio to their AI platform to increase … Continued

On Demand Video

Alluxio and Apache Ranger Best Practices

As data stewards and security teams provide broader access to their organization’s data lake environments, having a centralized way to manage fine-grained access policies … Continued