meetup Archives

AI/ML Infra Meetup – Highlights & Key Takeaways

July 10, 2024 By Chanchan Mao

Co-hosted by Alluxio and Uber on May 23, 2024, AI/ML Infra Meetup was the community event for developers focused on building AI, ML and data infrastructure at scale. We were thrilled by the overwhelming interest and enthusiasm in our meetup! This event brought together over 100 AI/ML infrastructure engineers and enthusiasts to discuss the latest … Continued

Zookeeper vs Raft: Stateful Distributed Coordination with HA and Fault Tolerance

October 21, 2022

Big Data Bellevue & Cloudy With a Chance of Data Meetup October 20, 2022 Distributed systems are made up of many components such as authentication, a persistence layer, stateless services, load balancers, and stateful coordination services. These coordination services are central to the operation of the system, performing tasks such as maintaining system configuration state, … Continued

Tags: big data, distributed systems, fault tolerance, high availability, meetup, raft, zookeeper

Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance

Alluxio Meetup * October 20, 2022

This talk will go over a generic example of stateful coordination service moving from Zookeeper to Raft.

Integrating Open Source Alluxio in AWS EKS with Terraform

April 1, 2021

The presentation talks about the best practices to set up and techniques to build a cluster with open source Alluxio on AWS EKS, for one of our clients, which made it Scalable, Reliable, and Secure by adapting to Kubernetes RBAC.

Tags: aws, data orchestration, eks, kubernetes, meetup, terraform

Accelerating Data Computation on Ceph Objects using Alluxio

November 11, 2020

In this talk, we will present how using Alluxio computation and storage ecosystems can better interact benefiting the “bringing the data close to the code” approach. Moving away from the complete disaggregation of computation and storage, data locality can enhance the computation performance. During this talk, we will present our observations and testing results that will show important enhancements in accelerating Spark Data Analytics on Ceph Objects Storage using Alluxio.

Tags: ceph, compute, data locality, distributed storage, meetup, object storage, storage

StorageQuery: federated querying on object stores, powered by Alluxio and Presto

August 25, 2020

Alluxio and Presto are a powerful combination to address the compute problem, which is part of the strategy used by Simbiose Ventures to create a product called StorageQuery – A platform to query files in cloud storages with SQL.

Tags: cloud storage, compute storage separation, meetup, object stores, presto, shannondb, sql, storagequery, under filesystem

How to Build a new Under Filesystem in Alluxio: Apache Ozone as an Example

July 7, 2020

In Alluxio, an Under File System is the plugin to connect to any file systems or object stores, so users can mount different storages like AWS S3 or HDFS into Alluxio namespace. This under filesystem is designed to be modular, in order to enable users to easily extend this framework with their own Under File System implementation and connect to a new or customized storage system.

Tags: apache ozone, aws s3, hdfs, meetup, object stores, storage, under filesystem

Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration Between Presto & Alluxio

May 7, 2020

For many latency-sensitive SQL workloads, Presto is often bound by retrieving distant data. In this talk, Rohit Jain, James Sun from Facebook and Bin Fan from Alluxio will introduce their teams’ collaboration on adding a local on-SSD Alluxio cache inside Presto workers to improve unsatisfied Presto latency.

Tags: local cache, meetup, performance, presto, sql workloads

Tag: meetup