compute storage separation Archives | Page 2 of 11

Powering Data Science and AI with Apache Spark, Alluxio, and IBM

Alluxio Global Online Meetup * October 15, 2019

In this online meetup, we will present the benefits of the fast analytics stack of Spark on Alluxio, and dive into China Unicom’s use case of leveraging Spark and Alluxio to process massive amounts of mobile data.

Bay Area Meetup: Alluxio 2.0 Deep Dive and Near Real-time Analytics with Spark

July 23, 2019

This meetup presents an overview of the motivations and design decisions behind the major changes in the Alluxio 2.0 release, and Real-time Data Processing for Sales Attribution Analysis with Alluxio, Spark and Hive at VIPShop.

Tags: alluxio engineering, apache hadoop, apache spark, compute, compute storage separation, data, data orchestration, hadoop, hdfs, meetup, scale, spark, storage

Recap: AWS Summit New York

July 22, 2019 By Amelia Wong

Alluxio is a proud sponsor and exhibitor at the AWS Summit in New York. If you weren’t able to attend, here are the highlights

Alluxio New York Meetup: Accelerating Analytical Workloads for Public & Hybrid Clouds

July 22, 2019

Joint hosted Alluxio New York meetup with talks to include: Embracing hybrid cloud for data-intensive analytic workloads and Alluxio on AWS EMR (fast storage access and sharing for Spark).

Tags: alluxio engineering, analytics, compute, compute storage separation, data, data orchestration, hybrid cloud, meetup, performance, storage

The Practice of Alluxio in Ctrip Real-Time Computing Platform

July 19, 2019 By Jianhua Guo

Today, real-time computation platform is becoming increasingly important in many organizations. In this article, we will describe how ctrip.com applies Alluxio to accelerate the Spark SQL real-time jobs and maintain the jobs’ consistency during the downtime of our internal data lake (HDFS). In addition, we leverage Alluxio as a caching layer to dramatically reduce the workload pressure on our HDFS NameNode.

Democratizing Data Orchestration | Haoyuan (H.Y.) Li – Alluxio

July 18, 2019

TFiR – Open Source & Emerging Technologies In this interview we spoke to Haoyuan (H.Y.) Li, Founder, Chairman and CTO of Open Source Alluxio, a company that is democratizing data in the cloud.

Tags: alluxio engineering, analytics, compute, compute storage separation, data, data orchestration

Tech Talk: Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage

July 17, 2019

The ever increasing challenge to process and extract value from exploding data with AI and analytics workloads makes a memory centric architecture with disaggregated storage and compute more attractive. This decoupled architecture enables users to innovate faster and scale on-demand. Enterprises are also increasingly looking towards object stores to power their big data & machine learning workloads in a cost-effective way. However, object stores don’t provide big data compatible APIs as well as the required performance.

In this webinar, the Intel and Alluxio teams will present a proposed reference architecture using Alluxio as the in-memory accelerator for object stores to enable modern analytical workloads such as Spark, Presto, Tensorflow, and Hive. We will also present a technical overview of Alluxio.

Tags: big data, compute storage separation, hive, intel, object stores, spark, tech talk, tensorflow

Tag: compute storage separation