ml Archives | Page 2 of 2

Machine Learning Model Training with Alluxio: Part 3 – Benchmarking

January 18, 2022 By Lu Qiu and Bin Fan

This blog is the last one in the machine learning series. Our first blog introduced the what and why of our solution, and the second blog compared traditional and Alluxio solutions. This blog will demonstrate how to set up and benchmark the end-to-end performance of the training process.

Machine Learning Model Training with Alluxio: Part 2 – Comparable Analysis

January 12, 2022 By Lu Qiu and Bin Fan

This blog is the second in the machine learning series following the previous one, which discussed Alluxio’s solution to improve training performance and simplify data management. With the help of Alluxio, loading data from cloud storage, training and caching data can be done in a transparent and distributed way as a part of the training process, thus improving training performance and simplifying data management. In this blog 2 of the series, we focus on comparing traditional solutions with Alluxio’s.

Machine Learning Model Training with Alluxio: Part 1 – Solution Overview

January 6, 2022 By Lu Qiu, Bin Fan and Hope Wang

In this blog, we provide an overview of Alluxio’s AI/ML model training solution. For more details about the reference architecture and benchmarking results, please refer to the full length whitepaper.

Speed up Large-scale ML/DL Offline Inference Jobs with Alluxio at Microsoft Bing

January 6, 2022 By Binyang Li and Qianxi Zhang

Running inference at scale is challenging. In this blog, we will share our observations and the practice to use Alluxio to speed up the I/O performance for large-scale ML/DL offline inference at Microsoft Bing.

Speeding up TensorFlow and PyTorch with Alluxio

September 9, 2021

The Alluxio core engineering team re-designed things to come up with a more efficient and transparent way for users to leverage data orchestration through the POSIX interface. This enables much better performance for ML workloads where data is accessed via the POSIX interface.

Tags: data orchestration, fuse, ml, performance, POSIX, pytorch, tensorflow

Tag: ml