Blog

Alluxio Blog

Accelerating and Scaling Big Data Analytics with Alluxio and Intel® Optane™ Persistent Memory

Testing Methodology Decision support workload is a typical workload that models multiple aspects of a decision support system, including queries and data maintenance. We selected 54 queries that represent a typical SQL query behavior in Hadoop for the test.  The tests include three different configurations: Without Alluxio, Alluxio on PMem and Alluxio on DRAM. The … Continued

Serving Structured Data in Alluxio: Concept

This article introduces Structured Data Management available in the latest Alluxio 2.2.0 release, a new effort to provide further benefits to SQL and structured data workloads using Alluxio.

Kubernetes, Alluxio and the Disaggregated Analytics Stack

Kubernetes, Alluxio and the disaggregated analytics stack  TL;DR: First the news – Alluxio support for K8s Helm charts now available! K8s is a certified environment for Alluxio. Now the take away- Alluxio brings back data locality for the disaggregated analytics stack in K8s. How? Read on.  There’s no arguing the rise of containers in real-world … Continued

Data Orchestration Summit Recap and Highlights!

We are delighted by the success of the inaugural Data Orchestration Summit on Nov. 7, 2019! Organized by Alluxio, this one-day event was sold out with nearly 400 attendees! Data engineers, cloud engineers, data scientists joined the talks of 24 industry leaders from all over the globe to share their experiences building cloud-native data and … Continued

Improving Spark Memory Resource with Off-Heap In-Memory Storage

In the previous tutorial ”Getting Started with Spark Caching using Alluxio in 5 Minutes”, we demonstrated how to get started with Spark and Alluxio. To share more thoughts and experiments on how Alluxio enhances Spark workloads, this article focuses on how Alluxio helps to optimize the memory utilization of Spark applications.  For users who are … Continued