data Archives | Page 2 of 12

Bay Area Meetup: Alluxio 2.0 Deep Dive and Near Real-time Analytics with Spark

July 23, 2019

This meetup presents an overview of the motivations and design decisions behind the major changes in the Alluxio 2.0 release, and Real-time Data Processing for Sales Attribution Analysis with Alluxio, Spark and Hive at VIPShop.

Tags: alluxio engineering, apache hadoop, apache spark, compute, compute storage separation, data, data orchestration, hadoop, hdfs, meetup, scale, spark, storage

NetEase and Alluxio joint meetup

Hangzhou Meetup * July 26, 2019

Joint meetup in Hangzhou discusses: An introduction to new features of big data storage system Alluxio and optimization of cache performance, Practice & exploration of Spark & Alluxio, and the Interactive query system Impala.

Efficient Data Engineering with Apache Spark, Hive, and Alluxio on S3

Alluxio Meetup | Austin * August 15, 2019

Welcome to the first event of the Cloud, Data, & Orchestration Austin Meetup! This meetup will feature two talks and an opportunity to engage with other data engineers, developers, and Alluxio users. Thanks to Bazaarvoice for hosting!

Alluxio 2.0 Deep Dive | Simplifying data access for cloud workloads

Alluxio Tech Talk * August 6, 2019

We will introduce the key new features and enhancements such as: Support for hyper-scale data workloads, Machine learning and deep learning workloads, and Better storage abstraction.

Recap: AWS Summit New York

July 22, 2019 By Amelia Wong

Alluxio is a proud sponsor and exhibitor at the AWS Summit in New York. If you weren’t able to attend, here are the highlights

Alluxio New York Meetup: Accelerating Analytical Workloads for Public & Hybrid Clouds

July 22, 2019

Joint hosted Alluxio New York meetup with talks to include: Embracing hybrid cloud for data-intensive analytic workloads and Alluxio on AWS EMR (fast storage access and sharing for Spark).

Tags: alluxio engineering, analytics, compute, compute storage separation, data, data orchestration, hybrid cloud, meetup, performance, storage

The Practice of Alluxio in Ctrip Real-Time Computing Platform

July 19, 2019 By Jianhua Guo

Today, real-time computation platform is becoming increasingly important in many organizations. In this article, we will describe how ctrip.com applies Alluxio to accelerate the Spark SQL real-time jobs and maintain the jobs’ consistency during the downtime of our internal data lake (HDFS). In addition, we leverage Alluxio as a caching layer to dramatically reduce the workload pressure on our HDFS NameNode.

Democratizing Data Orchestration | Haoyuan (H.Y.) Li – Alluxio

July 18, 2019

TFiR – Open Source & Emerging Technologies In this interview we spoke to Haoyuan (H.Y.) Li, Founder, Chairman and CTO of Open Source Alluxio, a company that is democratizing data in the cloud.

Tags: alluxio engineering, analytics, compute, compute storage separation, data, data orchestration

Cybersecurity and fraud detection at ING Bank using Presto & Alluxio on S3

Alluxio Global Online Meetup * August 1, 2019

This event features leading financial services company ING Bank’s user story on how they leverage open source technologies like Presto and Alluxio with S3.

Tag: data