tech talk Archives | Page 2 of 4

Alluxio Day III

Community Virtual Event * April 27, 2021

Join us for our 3rd Alluxio Day community virtual event featuring speakers from Nvidia, Alibaba, Aspect Analytics, and MSFT.

Tech Talk: Integrating Google Cloud Dataproc with Alluxio for faster performance in the cloud

December 12, 2019

Google Cloud Dataproc is a widely used fully managed Spark and Hadoop service to run big data analytics and compute workloads in the cloud. Services like Dataproc reduce hardware spend, eliminate the need to overbuy capacity, and provide business agility. Yet users still face challenges for performance sensitive workloads or workloads running on remote data.

Alluxio is an open source cloud data orchestration platform that increases performance of analytic workloads running on Dataproc by intelligently caching data and bringing back lost data locality. Alluxio also enables users to run compute workloads against on-prem storage like Hadoop HDFS without any app changes.

Chris Crosbie and Roderick Yao from the Google Dataproc team and Dipti Borkar of Alluxio demo how to set up Google Cloud Dataproc with Alluxio so jobs can seamlessly read from and write to Cloud Storage. They also show how to run Dataproc Spark against a remote HDFS cluster.

Tags: dataproc, google cloud, hdfs, tech talk

Tech Talk: The Path to Migrating off MapR

December 11, 2019

If you’re a MapR user, you might have concerns with your existing data stack. Whether it’s the complexity of Hadoop, financial instability and no future MapR product roadmap, or no flexibility when it comes to co-locating storage and compute, MapR may no longer be working for you.

Alluxio can help you migrate to a modern, disaggregated data stack using any object store with the similar performance of Hadoop plus significant cost savings.

Join us for this tech talk where we’ll discuss how to separate your compute and storage on-prem and architect a new data stack that makes your object store the core. We’ll show you how to offload your MapR/HDFS compute to any object store and how to run all of your existing jobs as-is on Alluxio + object store.

Tags: hdfs, mapr, object stores, tech talk

From limited Hadoop compute capacity to increased data scientist efficiency

Alluxio Tech Talk * October 16, 2019

This tech talk will share approaches to burst data to the cloud along with
how Alluxio can enable “zero-copy” bursting of Spark workloads to cloud data services like EMR and Dataproc. Learn how DBS bank uses Alluxio to solve for limited on-prem compute capacity.

Tech Talk: Accelerating Analytics with EMR on your S3 Data Lake

September 12, 2019

This tech talk gives shows how to set up EMR Spark and Hive with Alluxio to seamlessly read/write to your S3 data lake, along with performance benefits.

Tags: aws s3, emr, spark, tech talk

Tag: tech talk