office hour Archives

Speeding up TensorFlow and PyTorch with Alluxio

Alluxio Tech Talk * September 9, 2021

Driven by strong interests from our open-source community, the core team of Alluxio started to re-design an efficient and transparent way for users to leverage data orchestration through the POSIX interface.

Introducing what’s new in Alluxio 2.5

April 24, 2021

Alluxio 2.5 focuses on improving interface support to broaden the set of data driven applications which can benefit from data orchestration. The POSIX and S3 client interfaces have greatly improved in performance and functionality as a result of the widespread usage and demand from AI/ML workloads and system administration needs. Alluxio is rapidly evolving to meet the needs of enterprises that are deploying it as a key component of their AI/ML stacks.

Tags: alluxio engineering, data orchestration, hybrid cloud, office hour, release

What’s New in Alluxio 2.5

Community Online Office Hour * April 15, 2021

Alluxio 2.5 focuses on improving interface support to broaden the set of data driven applications which can benefit from data orchestration.

Introduction to what’s new in Alluxio 2.4

Community Online Office Hour * November 5, 2020

Alluxio 2.4.0 focuses on features critical to large scale, production deployments in Cloud and Hybrid Cloud environments. Features such as highly scalable metadata journaling, aggregate cluster metrics monitoring, and automated detection of JVM pauses further improve Alluxio’s suitability for demanding workloads.

What’s New in Alluxio 2.3

Community Online Office Hour * July 14, 2020

Alluxio 2.3 was just released at the end of June 2020. Calvin and Bin will go over the new features and integrations available and share learnings from the community. Any questions about the release and on-going community feature development are welcome.

Bursting Spark or Presto Jobs to AWS using Alluxio

June 23, 2020

In this office hour, we demonstrate how a “zero-copy burst” solution helps to speed up Spark and Presto queries in the public cloud while eliminating the process of manually copying and synchronizing data from the on-premise data lake to cloud storage. This approach allows compute frameworks to decouple from on-premise data sources and scale efficiently by leveraging Alluxio and public cloud resources such as AWS.

Tags: aws, cloud storage, compute, hdfs, hybrid cloud, office hour, performance, presto, spark, zero copy bursting

Bursting Spark or Presto Jobs to AWS using Alluxio

Community Online Office Hour * June 23, 2020

Burst Presto & Spark workloads to AWS EMR with no data copies

April 28, 2020

In this talk, we will show you how to leverage any public cloud (AWS, Google Cloud Platform, or Microsoft Azure) to scale analytics workloads directly on on-prem data without copying and synchronizing the data into the cloud.

Tags: analytic workloads, cloud, hdfs, hybrid cloud, office hour, presto, public cloud, spark

Tag: office hour