object storage Archives

Accelerating Data Computation on Ceph Objects using Alluxio

November 11, 2020

In this talk, we will present how using Alluxio computation and storage ecosystems can better interact benefiting the “bringing the data close to the code” approach. Moving away from the complete disaggregation of computation and storage, data locality can enhance the computation performance. During this talk, we will present our observations and testing results that will show important enhancements in accelerating Spark Data Analytics on Ceph Objects Storage using Alluxio.

Tags: ceph, compute, data locality, distributed storage, meetup, object storage, storage

Ultra-fast SQL Analytics using PAS (Presto on Alluxio Stack)

November 22, 2019

This talk describes a stack of open-source projects to serve high-concurrent and low-latency SQL queries using Presto with Alluxio on big data in the cloud. Deploying Alluxio as a data orchestration layer to access cloud storage object storage (e.g., AWS S3), this architecture greatly enhances the data locality of Presto with distributed and cross-query caching, thus avoids reading same data repeatedly from the cloud storage.

Tags: caching, cloud storage, data locality, meetup, object storage, presto

Tag: object storage