On Demand Video

StorageQuery: federated querying on object stores, powered by Alluxio and Presto


Over the last few years, organizations have worked towards the separation of storage and compute for a number of benefits in the areas of cost, data duplication and data latency. Cloud resolves most of these issues but comes to the expense of needing a way to query data on remote storages. Alluxio and Presto are a powerful combination to address the compute problem, which is part of the strategy used by Simbiose Ventures to create a product called StorageQuery – A platform to query files in cloud storages with SQL.

This talk will focus on:

  • How Alluxio fits StorageQuery’s tech stack;
  • Advantages of using Alluxio as a cache layer and its unified filesystem
  • Development of new under file system for Backblaze B2 and fine-grained code documentation;
  • ShannonDB remote storage mode.


Abner Ferreira is a backend developer at Simbiose Ventures. He is currently working on implementing fine-grained logs and code-level documentation for Alluxio.

Caio Pavanelli is a team lead backend developer at Simbiose Ventures. He is currently focused on customizing PrestoSQL and Alluxio for building StorageQuery’s platform. He has a M.Sc. in Electrical Engineering from Centro Universit√°rio da FEI on cognitive robotics.

Bin Fan is the founding engineer and VP of Open Source at Alluxio, Inc. Prior to Alluxio, he worked for Google to build the next-generation storage infrastructure. Bin received his Ph.D. in Computer Science from Carnegie Mellon University on the design and implementation of distributed systems.

Questions? Slack with the speakers, users, and many other community members!
Welcome to join Alluxio Global Online Meetup Group to attend online meetups like this!