Blog

Inferless Slashes AI Model Loading Time by 10x in LLM Serving Infrastructure Using Alluxio

Inferless solved critical I/O bottlenecks in LLM inference infrastructure by implementing Alluxio, achieving 10x faster model loading (from ~200 Mbps to 2+ Gbps), reducing cold start times from minutes to seconds, and significantly improving customer experience.

New Features in Alluxio Enterprise AI 3.6

Learn about the latest features in Alluxio AI 3.6, including Accelerated AI Cold Starts for inference servers, pushdown parquet query acceleration, and more!

How Coupang Leverages Distributed Cache to Accelerate Machine Learning Model Training

Coupang, a Fortune 200 technology company, manages a multi-cluster GPU architecture for their AI/ML model training. This architecture introduced significant challenges, including:

Time-consuming data preparation and data copy/movement
Difficulty utilizing GPU resources efficiently
High and growing storage costs
Excessive operational overhead maintaining storage for localized data silos

To resolve these challenges, Coupang’s AI platform team implemented a distributed caching system that automatically retrieves training data from their central data lake, improves data loading performance, unifies access paths for model developers, automates data lifecycle management, and extends easily across Kubernetes environments. The new distributed caching architecture has improved model training speed, reduced storage costs, increased GPU utilization across clusters, lowered operational overhead, enabled training workload portability, and delivered 40% better I/O performance compared to parallel file systems.

Alipay: Optimizing Alluxio for Efficient Large-Scale Training on Billions of Files

Cross Cluster Synchronization in Alluxio: Part 2 - Mechanism

This is part 2 of the blog series talking about the design and implementation of the Cross Cluster Synchronization mechanism in Alluxio. In the previous blog, we discussed the scenario, background and how metadata sync is done with a single Alluxio cluster. This blog will describe how metadata sync is built upon to provide metadata consistency in a multi-cluster scenario.

‍

Cross Cluster Synchronization in Alluxio: Part 3 - Discussions and Conclusion

Following part 1 and part 2, this final blog of the series discusses some design decisions and details, as well as certain future work.

Cross Cluster Synchronization in Alluxio: Part 1 - Scenarios and Background

This is a blog series talking about the design and implementation of the Cross Cluster Synchronization mechanism in Alluxio. This mechanism ensures that the metadata is consistent when running multiple Alluxio clusters. Part 1 of this blog series discusses the scenario and background.

Data Access as a Service at Shopee: Using Alluxio to Accelerate Interactive Queries and Enhance Developer Experience with Flexible APIs

Get Started with Trino and Alluxio in 5 Minutes

Hopping into the Year of Rabbit with Alluxio Community

Whats Next for Data Analytics AI and Cloud in 2023

Integrate Alluxio With Your Existing Data Stack Without Redefining Hive Tables

Architecting Data Orchestration: Four Use Cases

Modern analytics projects rely on a hodgepodge of compute clusters, data stores, and pipelines, flung across countries and continents. Enterprises struggle to meet performance SLAs without replicating lots of data or moving and re-coding applications.

‍

Whats New in Alluxio 2.9: Multi-Alluxio Synchronization, Kubernetes Operator and Flexible S3 Access Control

Tutorial of Building Multi-Cloud Data Lake using Delta Lake and Alluxio

Your selections don't match any items.

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer

Request a demo