Cloud analytics Caching

featured use case

Is data in the public cloud slowing your compute down? With Alluxio, get in-memory data access for Spark, Presto, Hive and other analytics frameworks on AWS S3, Google Cloud, or Microsoft Azure.

are analytics workloads in the cloud getting slow and expensive?

Running compute on top of S3/GCS/Azure comes with its own set of challenges.

Performance is variable and consistent query SLAs are hard to achieve

Metadata operations like list and rename are expensive, so workloads run longer


Egress costs, particularly cross-region, can add up, making the solution expensive

Eventual consistency on writes makes it hard to predict query results


alluxio accelerates analytics workloads in the cloud and saves costs

Co-locate your data with your compute for in-memory data access so you can cache data in the cloud in the same instance as your compute.

Reading S3/GCS/Azure data into Spark, Presto, Hive, or any other compute framework and enabling data sharing is automated and transparent with Alluxio, which serves data to compute to improve the end-to-end model development efficiency. Alluxio can be deployed colocated with your compute cluster, exposing the data through Alluxio POSIX or HDFS compatible interfaces and backed by a mounted remote storage like S3.

Want help getting your analytics workloads faster and less expensive? Schedule a meeting with an Alluxio solutions engineer.

alluxio on aws

Get started with Alluxio on AWS.

alluxio on s3

Alluxio supports access to different storage systems through its unified namespace. Configure S3 as Alluxio’s under storage system in six steps.

alluxio on ec2

Deploy Alluxio on AWS EC2 in five steps with the AWS Vagrant plugin. To run an Alluxio cluster on EC2, you’ll need to sign up for an EC2 account first.

alluxio on emr + s3

Run clusters on-demand for compute workloads with AWS EMR. Alluxio on EMR and S3 provides more functionality than EMRFS.

alluxio on google cloud

Get started with Alluxio on Google.

alluxio on GCS

Alluxio supports access to different storage systems through its unified namespace. Configure GCS as Alluxio’s under storage system in four steps.

alluxio on gce

Deploy Alluxio on GCE in five steps with the Google Vagrant plugin. To run an Alluxio cluster on GCE, you’ll need to sign up for a Google Cloud account first.

alluxio on microsoft azure

Get started with Alluxio on Azure.

alluxio on Azure blob store

Alluxio supports access to different storage systems through its unified namespace. Configure Azure Blob Store as Alluxio’s under storage system in three steps.

alluxio + presto and azure

Run Presto to query Alluxio as a distributed cache layer with Azure Blob Store. Alluxio will allow Presto tp access data from Azure and transparently cache the data frequently accessed.