Alluxio meetups, conferences, events and more

The latest Alluxio meetups, webinars, conferences and more

Events

Accelerate and Scale Big Data Analytics and Machine Learning Pipelines with Disaggregated Compute and Storage

Alluxio | SwiftStack Webinar *

Enterprises are increasingly looking towards object stores to power their big data & machine learning workloads in a cost-effective way. The combination of SwiftStack and Alluxio together, enables users to seamlessly move towards a disaggregated architecture. Swiftstack provides a massively parallel cloud object storage and multi-cloud data management system. Alluxio is a data orchestration layer, which sits between compute frameworks and storage systems and enables big data workloads to be deployed directly on SwiftStack. Alluxio provides data locality, accessibility and elasticity via its core innovations. With the Alluxio and Swiftstack solution, Spark, Presto, Tensorflow and Hive and other compute workloads can benefit from 10X performance improvement and dramatically lower costs.

Running Apache Spark with Alluxio for Fast Data Analytics

Alluxio Community Office Hour *

In this Office Hour you’ll learn about:
Using Alluxio as the input/output for Spark applications, Saving and loading Spark RDDs and Dataframes with Alluxio, Open Session for discussion on any topics such as solving the separation of compute and storage problem, unifying multiple storage systems, and more

Achieving Separation of Compute and Storage in a Cloud World

Alluxio Webinar *

In this webinar, we will discuss:
Why leading enterprises are adopting hybrid cloud architectures with compute and storage disaggregated, The new challenges that this new paradigm introduces, An introduction to Alluxio and the unified data solution it provides for hybrid environments

Getting Started with Alluxio Open Source

Alluxio Community Office Hour *

Join us for our first monthly office hour. This month we will focus on:
Installing Alluxio using Docker and Homebrew on your local Linux/Mac machine and accessing data from S3 and HDFS, Understanding Alluxio’s architecture in the data ecosystem, Open Session for discussion on any topics such as solving the separation of compute and storage problem, unifying multiple storage systems, and more.

Unified Big Data Analytics – Any stack, Any Cloud

Boston Meetup *

This presentation focuses on how Alluxio helps the big data analytics stack to be cloud-native. The trending Cloud object storage systems provide more cost-effective and scalable storage solutions but also different semantics and performance implications compared to HDFS. Applications like Spark or Presto will not benefit from the node-level locality or cross-job caching when retrieving data from the cloud object storage. Deploying Alluxio to access cloud solves these problems because data will be retrieved and cached in Alluxio instead of the underlying cloud or object storage repeatedly.

Alluxio+Presto: An Architecture for Fast SQL in the Cloud

Bay Area Meetup *

Cloud object storage systems provide different semantics and performance implications compared to HDFS. Applications like Presto cannot benefit from the node-level locality or cross-job caching when reading from the cloud. Deploying Alluxio with Presto to access cloud solves these problems because data will be retrieved and cached in Alluxio instead of the underlying cloud or object storage repeatedly. Bin will present the architecture to combine Presto with Alluxio with use cases from major internet companies like JD.com and NetEase.com, and their lessons learned to operate this architecture at scale.

Beijing Meetup: Talks from Sogou, Qiniu, JD.com & Alluxio

Beijing Meetup *

The future is the era of data, and the abstraction of efficient management, storage, and access to data is undoubtedly the cornerstone of this era. Open source distributed virtual data system Alluxio is dedicated to providing simple and efficient data abstraction, convenient data sharing and high-speed I/O for big data, machine learning, and artificial intelligence, while keeping applications and data persistent and providing rich Storage system selection. After several years of development, Alluxio was developed from a prototype of a research project involving only a few Ph.D. students and researchers in the AMPLab at the University of California, Berkeley, to more than 800 code contributors (Alluxio 1.8 release data), and deployed in Tencent. Baidu, JD, Two-Sigma, Barclays Bank and other hundreds of Chinese and foreign industry leaders in the production environment, become an important part of the data platform and data infrastructure.

Shanghai Meetup: Talks from Ctrip, Qiniu, Intel & Alluxio

Shanghai Meetup *

The future is the era of data, and the abstraction of efficient management, storage, and access to data is undoubtedly the cornerstone of this era. Open source distributed virtual data system Alluxio is dedicated to providing simple and efficient data abstraction, convenient data sharing and high-speed I/O for big data, machine learning, and artificial intelligence, while keeping applications and data persistent and providing rich Storage system selection.
After several years of development, Alluxio was developed from a prototype of a research project involving only a few Ph.D. students and researchers in the AMPLab at the University of California, Berkeley, to more than 800 code contributors (Alluxio 1.8 release data), and deployed in Tencent. Baidu, JD, Two-Sigma, Barclays Bank and other hundreds of Chinese and foreign industry leaders in the production environment, become an important part of the data platform and data infrastructure.