Products
Blog

Alluxio's Strong Q2: Sub-Millisecond AI Latency, 50%+ Customer Growth, and Industry-Leading MLPerf Results
Alluxio's strong Q2 featured Enterprise AI 3.7 launch with sub-millisecond latency (45× faster than S3 Standard), 50%+ customer growth including Salesforce and Geely, and MLPerf Storage v2.0 results showing 99%+ GPU utilization, positioning the company as a leader in maximizing AI infrastructure ROI.

How Blackout Power Trading Achieved Multi-Join Double-Digit Millisecond Latency Offline Feature Store Performance with Alluxio Low Latency Caching
In this blog, Greg Lindstrom, Vice President of ML Trading at Blackout Power Trading, an electricity trading firm in North American power markets, shares how they leverage Alluxio to power their offline feature store. This approach delivers multi-join query performance in the double-digit millisecond range, while maintaining the cost and durability benefits of Amazon S3 for persistent storage. As a result, they achieved a 22 to 37x reduction in large-join query latency for training and a 37 to 83x reduction in large-join query latency for inference.
.png)
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
.jpeg)
Speed Up Ubers Presto with Alluxio A collaboration between Uber and Alluxio part 1
This article shares how Uber and Alluxio collaborated to design and implement Presto local cache to reduce HDFS latency.
Hybrid Multi-Cloud
Large Scale Analytics Acceleration
.jpeg)
Deep Dive into the Implementation of Alluxio Metadata Storage
This article introduces the design and implementation of metadata storage in Alluxio Master, either on heap and off heap (based on RocksDB).
No items found.

Whats New in Alluxio 2.8: Enhanced S3 API Functionality Enterprise-grade Security and Data Migration With Better Usability and Low Cost
No items found.
.jpeg)
From Zookeeper to Raft: How Alluxio Stores File System State with High Availability and Fault Tolerance
Raft is an algorithm for state machine replication as a way to ensure high availability (HA) and fault tolerance. This blog shares how Alluxio has moved to a Zookeeper-less, built-in Raft-based journal system as a HA implementation.
Large Scale Analytics Acceleration
.jpeg)
Recommendations to Level Up Your Machine Learning Platform
With machine learning (ML) and artificial intelligence (AI) applications becoming more business-critical, organizations are in the race to advance their AI/ML capabilities. To realize the full potential of AI/ML, having the right underlying machine learning platform is a prerequisite.
Data Migration
GPU Acceleration
Model Training Acceleration
.jpeg)
Orchestrating Data for Machine Learning Pipelines
This article will discuss a new solution to orchestrating data for end-to-end machine learning pipelines that addresses the above questions. I will outline common challenges and pitfalls, followed by proposing a new technique, data orchestration, to optimize the data pipeline for machine learning.
GPU Acceleration
Model Training Acceleration
.jpeg)
From Cache to Cash: Introducing NFT for Data Orchestration
Today, we are excited to announce the launch of Non-fungible token (NFT) as a new feature in our leading data orchestration platform.
No items found.
.jpeg)
Improving Presto Architectural Decisions with Alluxio Shadow Cache at Meta Facebook
With the collaboration between Meta (Facebook), Princeton University, and Alluxio, we have developed "Shadow Cache" – a lightweight Alluxio component to track the working set size and infinite cache hit ratio. Shadow cache can keep track of the working set size over the past window dynamically and is implemented by a series of bloom filters. Shadow cache is deployed in Meta (Facebook) Presto and is being leveraged to understand the system bottleneck and help with routing design decisions.
Large Scale Analytics Acceleration

Accelerate Auto Data Tagging with Alluxio and Spark in Hybrid Cloud A Practice in WeRide
This blog shares the practice of using Alluxio and Spark to accelerate the auto data tagging system in WeRide, an autonomous driving technology company.
Hybrid Multi-Cloud
GPU Acceleration
Large Scale Analytics Acceleration
Model Training Acceleration
.jpeg)
Pair Spark with Alluxio to Modernize Your Data Platform
Alluxio is the data orchestration platform to unify data silos across heterogeneous environments. This is the last article in a series to give you the basics of Alluxio’s architecture and solution.
Data Platform Modernization
GPU Acceleration
Your selections don't match any items.