Join us for the 12th Alluxio Day virtual community event featuring speakers from Shopee, Websec, and Alluxio.
Tag: tech talk
Join us for the 10th Alluxio Day virtual community event featuring speakers from Uber, BiliBili, and Alluxio.
Join us for our 8th Alluxio Day virtual community event featuring speakers from Apache Iceberg and WeRide.
Join us for our 6th Alluxio Day community virtual event featuring speakers from Join us for our 4th Alluxio Day community virtual event featuring speakers from Facebook, Princeton, Apache Hudi, Zendesk, and Uber.
Join us for our 4th Alluxio Day community virtual event featuring speakers from Facebook, TikTok,
Tencent, and Intel.
Join us for our 3rd Alluxio Day community virtual event featuring speakers from Nvidia, Alibaba, Aspect Analytics, and MSFT.
Join us for this tech talk where we’ll introduce the Starburst Presto, Alluxio, and cloud object store stack for building a highly-concurrent and low-latency analytics platform.
Google Cloud Dataproc is a widely used fully managed Spark and Hadoop service to run big data analytics and compute workloads in the cloud. Services like Dataproc reduce hardware spend, eliminate the need to overbuy capacity, and provide business agility. Yet users still face challenges for performance sensitive workloads or workloads running on remote data.
Alluxio is an open source cloud data orchestration platform that increases performance of analytic workloads running on Dataproc by intelligently caching data and bringing back lost data locality. Alluxio also enables users to run compute workloads against on-prem storage like Hadoop HDFS without any app changes.
Chris Crosbie and Roderick Yao from the Google Dataproc team and Dipti Borkar of Alluxio demo how to set up Google Cloud Dataproc with Alluxio so jobs can seamlessly read from and write to Cloud Storage. They also show how to run Dataproc Spark against a remote HDFS cluster.
If you’re a MapR user, you might have concerns with your existing data stack. Whether it’s the complexity of Hadoop, financial instability and no future MapR product roadmap, or no flexibility when it comes to co-locating storage and compute, MapR may no longer be working for you.
Alluxio can help you migrate to a modern, disaggregated data stack using any object store with the similar performance of Hadoop plus significant cost savings.
Join us for this tech talk where we’ll discuss how to separate your compute and storage on-prem and architect a new data stack that makes your object store the core. We’ll show you how to offload your MapR/HDFS compute to any object store and how to run all of your existing jobs as-is on Alluxio + object store.