data orchestration Archives | Page 11 of 16

Optimizing Query Performance by Decoupling Presto and Hive Data Warehouse

March 24, 2020

Ideally, Presto would access data independently from how the data was originally stored or managed. Alluxio, as a data orchestration layer provides the physical data independence, for Presto to interact with the data more efficiently. In addition to caching for IO acceleration, Alluxio also provides a catalog service to abstract the metadata in the Hive Metastore, and transformations to expose the data in compute-optimized way. In this talk, we describe some of the challenges of using Presto with Hive, and introduce Alluxio data orchestration for solving those challenges.

Tags: alluxio engineering, catalog service, data orchestration, hive, office hour, performance, presto, structured data services

Optimizing Query Performance by Decoupling Presto and Hive Data Warehouse

Community Online Office Hour * March 24, 2020

Alluxio, as a data orchestration layer provides the physical data independence, for Presto to interact with the data more efficiently. In addition to caching for IO acceleration, Alluxio also provides a catalog service to abstract the metadata in the Hive Metastore, and transformations to expose the data in compute-optimized way. In this talk, we describe some of the challenges of using Presto with Hive, and introduce Alluxio data orchestration for solving those challenges.

Open source data orchestration for a disaggregated analytics stack

Bangalore Presto Meetup * January 11, 2020

The rise of compute intensive workloads and the adoption of the cloud has driven organizations to adopt a decoupled architecture for modern workloads – one in which compute scales independently from storage. While this enables scaling elasticity, it introduces new problems – how do you co-locate data with compute, how do you unify data across multiple remote clouds, how do you keep storage and I/O service costs down and many more.

Open Source Panel: How to create an open source project

November 12, 2019

In this panel, creators of open source projects share their stories from why they started the project to the challenges they encountered on the way.

Tags: conference, data orchestration, data orchestration summit, open source

Orchestrate a Data Symphony

November 12, 2019

In this talk, HY discussed the key challenges and trends impacting data engineering, and explores the concept of Data Orchestration.

Tags: conference, data engineering, data orchestration, data orchestration summit

Tag: data orchestration