Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration Between Presto & Alluxio

Tags: , , , ,

ALLUXIO GLOBAL ONLINE MEETUP

For many latency-sensitive SQL workloads, Presto is often bound by retrieving distant data. In this talk, Rohit Jain, James Sun from Facebook and Bin Fan from Alluxio will introduce their teams’ collaboration on adding a local on-SSD Alluxio cache inside Presto workers to improve unsatisfied Presto latency.

This talk will focus on:

  • Insights of the Presto workloads at Facebook w.r.t. cache effectiveness
  • API and internals of the Alluxio local cache, from design trade-offs (e.g. caching granularity, concurrency level and etc) to performance optimizations.
  • Initial performance analysis and timeline to deliver this feature for general Presto users.
  • Discussion on our future work to optimize cache performance with deeper integration with Presto

Speakers:

Rohit Jain is a software engineer at Facebook. He is currently developing solutions to help low latency queries in Presto at Facebook..


Yutian “James” Sun is a Software Engineer at Facebook working on large-scale distributed database systems. Major interests are query optimization, data federation, and low-latency query execution. James received his Ph.D in Computer Science from University of California, Santa Barbara focusing on data integration and data-centric processes.

Bin Fan is the founding engineer and VP of Open Source at Alluxio, Inc. Prior to Alluxio, he worked for Google to build the next-generation storage infrastructure. Bin received his Ph.D. in Computer Science from Carnegie Mellon University on the design and implementation of distributed systems.

Questions? Slack with the speakers, users, and many other community members!
Welcome to join Alluxio Global Online Meetup Group to attend online meetups like this!

Video:

Slides: