Alluxio meetups, conferences, events and more

The latest Alluxio meetups, webinars, conferences and more

Past Events:

Alluxio Day x APAC Modern Data Stack

Qanvast@OUE, OUE Downtown Gallery 1 * September 22, 2022

Join us for these great talks featuring speakers from RisingWave Labs, Onehouse, Shopee, and Alluxio! Learn about how Alluxio helps the big data analytics stack to be cloud-native, why modern data stack is more than a buzzword, an overview of community-driven major features in Apache Hudi’s open-source community, and how Shopee leverages Alluxio to accelerate Presto query. Attendees can join both in-person in Singapore as well as online on Zoom.

Alluxio Day 15

Community Virtual Event * September 15, 2022

Join fellow Alluxio community users for the 15th Alluxio Community Day virtual event featuring speakers from OceanBase, Twitter, StarTree, and Alluxio!

Deconstructing a Machine Learning Pipeline with Virtual Data Lake

Alluxio Product School * August 25, 2022

As more and more companies turn to AI / ML / DL to unlock insight, AI has become this mythical word that adds unnecessary barriers to new adaptors. Oftentimes it was regarded as luxury for those big tech companies only – this should not be the case.

Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds

TDEA | Alluxio * June 30, 2022

Alluxio foresaw the need for agility when accessing data across silos separated from compute engines like Spark, Presto, Tensorflow and PyTorch. Embracing the separation of storage from compute, the Alluxio data orchestration platform simplifies adoption of the data lake and data mesh paradigm for analytics and AI/ML.

Domain Specific — Why Data Mesh Works

DM Radio * June 16, 2022

Host @eric_kavanagh will interview legendary Analyst Mike Ferguson of Intelligent Business Strategies, Bin Fan of Alluxio and Adrian Estala of Starburst on the topic of data mesh.

Building a Distributed File System for the Cloud-Native Era

Alluxio Meetup * May 19, 2022

Today, data engineering in modern enterprises has become increasingly more complex and resource-consuming, particularly because (1) the rich amount of organizational data is often distributed across data centers, cloud regions, or even cloud providers, and (2) the complexity of the big data stack has been quickly increasing over the past few years with an explosion in big-data analytics and machine-learning engines (like MapReduce, Hive, Spark, Presto, Tensorflow, PyTorch to name a few).

Alluxio and Apache Ranger Best Practices

Alluxio Product School * May 26, 2022

As data stewards and security teams provide broader access to their organization’s data lake environments, having a centralized way to manage fine-grained access policies becomes increasingly important. Alluxio can use Apache Ranger’s centralized access policies in two ways: 1) directly controlling access to virtual paths in the Alluxio virtual file system or 2) enforcing existing access policies for the HDFS under stores. This presentation discusses how the Alluxio virtual filesystem can be integrated with Apache Ranger.

Alluxio Day 12

Community Virtual Event * April 28, 2022

Join us for the 12th Alluxio Day virtual community event featuring speakers from Shopee, Websec, and Alluxio.

Geo-distributed Analytics with NetApp StorageGRID and Alluxio

Alluxio Product School * March 24, 2022

This presentation will include information about how Alluxio and NetApp StorageGRID helps enterprises accelerate the adoption of cloud and optimize their resource spend on a modern hybrid big data architecture. The conversation will cover use case and architecture info from a variety of enterprises and some of the high level technical details of how these business solutions are constructed.