Architecting Data Orchestration: Four Use Cases

Originally published on Eckerson.com: https://www.eckerson.com/articles/architecting-data-orchestration-four-use-cases ABSTRACT: This blog explores four use cases for data orchestration and examples of the supporting architectural elements. Modern analytics projects rely on a hodgepodge of compute clusters, data stores, and pipelines, flung across countries and continents. Enterprises struggle to meet performance SLAs without replicating lots of data or moving and re-coding … Continued

Data Orchestration: Simplifying Data Access for Analytics

Originally published on Eckerson.com: https://www.eckerson.com/articles/data-orchestration-simplifying-data-access-for-analytics   The problem with data modernization initiatives is that they result in distributed datasets that impede analytics projects. As enterprises start their cloud migration journey, adopt new types of applications, data stores, and infrastructure, they still leave residual data in the original location. This results in far-flung silos that can be … Continued

Avoid Data Silos in Presto in Meta: the journey from Raptor to RaptorX

This blog was originally published in the Presto blog: https://prestodb.io/blog/2022/01/28/avoid-data-silos-in-presto-in-meta Alluxio: Rongrong Zhong Meta: James Sun, Ke Wang Raptor is a Presto connector (presto-raptor) that is used to power some critical interactive query workloads in Meta (previously Facebook). Though referred to in the ICDE 2019 paper Presto: SQL on Everything, it remains somewhat mysterious to many Presto users … Continued

A Year with Alluxio Community 2021

2021 marked accelerated growth for the Alluxio Open Source Project. We could not be more grateful for what the community has achieved together in this past year. This blog provides a glimpse of the year long summary of our community growth.