kubernetes Archives | Page 3 of 3

Improving Data Locality for Spark Jobs on Kubernetes Using Alluxio

Alluxio Community Office Hour * December 17, 2019

One important performance optimization in Apache Spark is to schedule tasks on nodes with HDFS data nodes locally serving the task input data. However, more users are running Apache Spark natively on Kubernetes where HDFS is not an option. This office hour describes the concept and dataflow with respect to using the stack of Spark/Alluxio in Kubernetes with enhanced data locality even the storage service is outside or remote.

Deep Learning and Gene Computing Acceleration with Alluxio in Kubernetes

November 12, 2019

Learn about Alibaba’s use case in deep learning and gene computing acceleration using Alluxio in Kubernetes.

Tags: conference, data engineering, data orchestration, data orchestration summit, kubernetes

Accelerating Spark with Kubernetes

Alluxio Tech Talk * August 7, 2019

This tech talk gives a quick overview of Alluxio and the use cases it powers for Spark/Presto in Kubernetes. We also show you how to set up Alluxio and Spark/Presto to run in Kubernetes.

Is Alluxio able to create a data grid for Kubernetes?

Alluxio is available via Docker. You can create a cluster of Alluxio within a Kubernetes cluster. Given that we do have these containers, you can either use a daemon set or a replica set within a Kubernetes cluster to create an alluxio cluster itself and have it co-located within your other nodes that may be … Continued

Community Office Hour: Running Spark & Alluxio in Kubernetes

June 25, 2019 by Bin Fan & Adit Madan

The data orchestration layer bridging the gap between data locality with improved performance and data accessibility for analytics workloads in Kubernetes, and enables portability across storage providers.
An overview of Alluxio and the cloud use case with Spark in Kubernetes. Learn how to set up Alluxio and Spark to run in Kubernetes.

Tags: analytics, apache spark, compute, compute storage separation, data, data orchestration, hybrid cloud, kubernetes, locality, multi cloud, office hour, spark, storage

Open Source Fest with Alluxio & Ignite + How to Accelerate Analytic Queries!

Bay Area Meetup * June 24, 2019

Join us June 24 in Menlo Park for our next meetup! We’ll have 3 valuable talks, a delicious BBQ dinner and amazing summertime-themed raffle prizes! This free event is sponsored by GridGain Systems and Oracle.

Recap: Spark+AI Summit 2019

May 2, 2019 By Amelia Wong

Alluxio is a proud sponsor and exhibitor of Spark+AI Summit in San Francisco.
What’s Spark+AI Summit? It’s the world’s largest conference that is focused on Apache Spark – Alluxio’s older cousin open source project from the same lab (UC Berkeley’s AMPLab – now RISElab).

Tag: kubernetes