Alluxio AI Infra Day 2024

AI Infra Day | The AI Infra in the Generative AI Era

AI Infra Day | Accelerate Your Model Training and Serving with Distributed Caching

AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale

AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta

AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Update

AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kubernetes

Blog

Blog

Integrate Alluxio With Your Existing Data Stack Without Redefining Hive Tables

On Demand Videos

On Demand Videos

Alluxio 2.9 Release Overview

Blog

Blog

Whats New in Alluxio 2.9: MultiAlluxio Synchronization Kubernetes Operator and Flexible S3 Access Control

Blog

Blog

Architecting Data Orchestration Four Use Cases

Modern analytics projects rely on a hodgepodge of compute clusters, data stores, and pipelines, flung across countries and continents. Enterprises struggle to meet performance SLAs without replicating lots of data or moving and re-coding applications.

‍

On Demand Videos

On Demand Videos

Building a Distributed File System For The Cloud-Native Era

Big Data Bellevue Meetup

Blog

Blog

Tutorial of Building MultiCloud Data Lake using Delta Lake and Alluxio

On Demand Videos

On Demand Videos

Zookeeper vs Raft: Stateful Distributed Coordination with HA and Fault Tolerance

Big Data Bellevue & Cloudy With a Chance of Data Meetup

Case Study

Case Study

Achieving Hybrid and Multi-Cloud Architecture With Application Portability

A Fortune 50 technology company that serves over 1 billion users successfully implemented Alluxio to achieve a hybrid cloud strategy, become multi-cloud ready, cut costs, and boost agility.

Case Study

Case Study

Expedia Group

Unify Data Lakes Across Multiple Geographic Regions in the Cloud

On Demand Videos

On Demand Videos

Architecting Data Platform Across Regions and Clouds for Analytics and AI

Blog

Blog

Data Orchestration Simplifying Data Access for Analytics

The problem with data modernization initiatives is that they result in distributed datasets that impede analytics projects. As enterprises start their cloud migration journey, adopt new types of applications, data stores, and infrastructure, they still leave residual data in the original location. This results in far-flung silos that can be slow, complex and expensive to analyze. As business demands for analytics rise—along with cloud costs—enterprises need to rationalize how they access and process distributed data. They cannot afford to replicate entire datasets or rewrite software every time they study data in more than one location.

‍

Presentation

Presentation

Unified Data API for Distributed Cloud Analytics and AI

ALLUXIO DAY x APAC Modern Data Stack 2022

Alluxio (www.alluxio.io) is an open-source virtual distributed file system that provides a unified data access layer for hybrid and multi-cloud deployments. It enables distributed compute engines like Spark, Presto or Machine Learning frameworks like TensorFlow to transparently access different persistent storage systems (including HDFS, S3, Azure and etc) while actively leveraging in-memory cache to accelerate data access. Developed originally from UC Berkeley AMPLab as research project “Tachyon”, Alluxio has more than 1200 contributors and is used by over 100 companies worldwide with the largest production deployment over 1000 nodes.