Products
Alluxio AI Infra Day 2024
.png)

AI Infra Day | The AI Infra in the Generative AI Era

AI Infra Day | Accelerate Your Model Training and Serving with Distributed Caching

AI Infra Day | Model Lifecycle Management Quality Assurance at Uber Scale

AI Infra Day | Composable PyTorch Distributed with PT2 @ Meta

AI Infra Day | The Generative AI Market And Intel AI Strategy and Product Update

AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kubernetes
.jpeg)

Blog
.jpeg)
Blog
From Zookeeper to Raft How Alluxio Stores File System State with High Availability and Fault Tolerance
Raft is an algorithm for state machine replication as a way to ensure high availability (HA) and fault tolerance. This blog shares how Alluxio has moved to a Zookeeper-less, built-in Raft-based journal system as a HA implementation.
Large Scale Analytics Acceleration
.jpeg)

Blog
.jpeg)
Blog
Recommendations to Level Up Your Machine Learning Platform
With machine learning (ML) and artificial intelligence (AI) applications becoming more business-critical, organizations are in the race to advance their AI/ML capabilities. To realize the full potential of AI/ML, having the right underlying machine learning platform is a prerequisite.
Data Migration
GPU Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Orchestrating Data for Machine Learning Pipelines
This article will discuss a new solution to orchestrating data for end-to-end machine learning pipelines that addresses the above questions. I will outline common challenges and pitfalls, followed by proposing a new technique, data orchestration, to optimize the data pipeline for machine learning.
GPU Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Improving Presto Architectural Decisions with Alluxio Shadow Cache at Meta Facebook
With the collaboration between Meta (Facebook), Princeton University, and Alluxio, we have developed "Shadow Cache" – a lightweight Alluxio component to track the working set size and infinite cache hit ratio. Shadow cache can keep track of the working set size over the past window dynamically and is implemented by a series of bloom filters. Shadow cache is deployed in Meta (Facebook) Presto and is being leveraged to understand the system bottleneck and help with routing design decisions.
Large Scale Analytics Acceleration


Blog

Blog
Accelerate Auto Data Tagging with Alluxio and Spark in Hybrid Cloud A Practice in WeRide
This blog shares the practice of using Alluxio and Spark to accelerate the auto data tagging system in WeRide, an autonomous driving technology company.
Hybrid Multi-Cloud
GPU Acceleration
Large Scale Analytics Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Pair Spark with Alluxio to Modernize Your Data Platform
Alluxio is the data orchestration platform to unify data silos across heterogeneous environments. This is the last article in a series to give you the basics of Alluxio’s architecture and solution.
Data Platform Modernization
GPU Acceleration


Blog

Blog
Using Consistent Hashing in Presto to Improve Caching Data Locality in Dynamic Clusters
Running Presto with Alluxio is gaining popularity in the community. It avoids long latency reading data from remote storage by utilizing SSD or memory to cache hot dataset close to Presto workers. Presto supports hash-based soft affinity scheduling to enforce that only one or two copies of the same data are cached in the entire cluster, which improves cache efficiency by allowing more hot data cached locally. The current hashing algorithm used, however, does not work well when cluster size changes. This article introduces a new hashing algorithm for soft affinity scheduling, consistent hashing, to address this problem.
Large Scale Analytics Acceleration
.jpeg)

Blog
.jpeg)
Blog
Alluxio and Apache Ranger Best Practices
As data stewards and security teams provide broader access to their organization’s data lake environments, having a centralized way to manage fine-grained access policies becomes increasingly important. Alluxio can use Apache Ranger’s centralized access policies in two ways: 1) directly controlling access to virtual paths in the Alluxio virtual file system or 2) enforcing existing access policies for the HDFS under stores.
No items found.
.jpeg)

Blog
.jpeg)
Blog
ThousandNode Alluxio Cluster Powers Game AI Platform A Production Case Study from Tencent
To provide model training with the best experience, Tencent has implemented a 1000-node Alluxio cluster and designed a scalable, robust, and performant architecture to speed up Ceph storage for game AI training. This blog will give you insight into how Alluxio has been implemented and optimized at Tencent.
Model Training Acceleration
Your selections don't match any items.