Alluxio Products and Releases - Community and Enterprise Editions

What’s New In Alluxio Enterprise AI 3.2: GPU Acceleration, Python Filesystem API, Write Checkpointing and More!

July 9, 2024 By Shouwei Chen and Adit Madan

Performance, cache operability, and cost efficiency are key considerations for AI platform teams supporting large scale model training and distribution. In 2023, we launched Alluxio Enterprise AI, for managing AI training and model distribution I/O across diverse environments, whether in a single storage with diverse computing clusters or in a more complex multi-cloud, multi-data center … Continued

How Can AI Platforms Adapt to Hybrid or Multi-Cloud Environments?

May 20, 2024 By Hope Wang

This article was originally published on Spiceworks. https://www.spiceworks.com/tech/artificial-intelligence/guest-article/adapting-ai-platform-to-hybrid-cloud/ This blog discusses the challenges of implementing AI platforms in hybrid and multi-cloud environments and shares examples of organizations that have prioritized security and optimized cost management using the data access layer. In recent years, AI platforms have undergone significant transformations as GenAI and AI continue to … Continued

Maximize GPU Utilization for Model Training

April 3, 2024 By Hope Wang

GPU utilization or GPU usage, is the percentage of GPUs’ processing power being used at a particular time. As GPUs are expensive resources, optimizing their utilization and reducing idle time is essential for enterprise AI infrastructure. This blog explores bottlenecks hindering GPU utilization during model training and provides solutions to maximize GPU utilization. 1. Why … Continued

Why Adding NAS/NFS on Object Storage May not Solve Your Data Access Problem of AI

November 28, 2023 By Tarik Bennett, Beinan Wang and Hope Wang

In this blog, we discuss the data access challenges in AI and why commonly used NAS/NFS may not be a good option for your organization. 1. Early Architecture of AI/ML According to Gartner, although LLMs are on the hype, most organizations are in the early stages, with some in production. In the early stages of … Continued

Introducing Alluxio Enterprise AI and A Vision Beyond Unintelligent Storage

October 18, 2023 By Adit Madan, Bin Fan and Haoyuan Li

We take great pride in the Alluxio Data Platform serving many of the most critical data-driven applications in the world as we speak today. Each of us interact with platforms empowered by Alluxio on a daily basis, and unknowingly you are as well. From the voice assistant we speak to, the bank we transact with, … Continued

Alluxio Kubernetes Operator Tutorial: Simplifying Deploying and Managing Alluxio Clusters

August 14, 2023 By Shawn Sun, Beinan Wang and Hope Wang

This blog provides a tutorial on using the Kubernetes operator to simplify deploying and managing Alluxio clusters on Kubernetes. Introduction The Alluxio Kubernetes operator makes deploying and managing Alluxio and the datasets on Kubernetes easier. With the operator, Alluxio clusters can be deployed and managed seamlessly like any other native Kubernetes application. The operator handles … Continued

Top Tips and Tricks for PyTorch Model Training Performance Tuning [2023]

July 22, 2023 By Hope Wang, Beinan Wang and Chunxu Tang

Get the latest and greatest tips to accelerate your PyTorch model training for machine learning and deep learning. PyTorch, an open-source machine learning framework, has become the de facto choice for many organizations to develop and deploy deep learning models. Model training is the most compute-intensive phase of the machine learning pipeline. It requires continuous … Continued

What’s New in Alluxio Enterprise 2.10: Radically Resource-efficient for Improved Speed at Lower Cost

June 20, 2023 By Adit Madan

We are pleased to unveil the latest version of Alluxio. This new release represents a significant milestone to enhance system reliability under different kinds of resource limitations or stress scenarios, particularly to get the most out of limited hardware resources to scale at manageable costs. Enhanced Functionality: Dramatic Improvements in High Availability (HA): Mission-critical applications … Continued

Category: Product