Hybrid Environments for Data Analytics is a Possibility

June 21, 2019

Adit Madan

This blog is authored by Madan Kumar and Alex Ma originally posted on medium.

As the data ecosystem becomes massively complex and more and more disaggregated, data analysts and end users have trouble adapting and working with hybrid environments. The proliferation of compute applications along with storage mediums leads to a hybrid model that we are just not accustomed to.

With this disaggregated system data engineers now come across a multitude of problems that they must overcome in order to get meaningful insights.

Enabling connections between the various computes and storage becomes increasingly complex.
Often seeing low performance due to lack of data locality for compute, which is a new challenge that we did not have to face previously in collocated environments(storage & compute together).
Ultimately having to deal with high costs, mainly due to creating multiple copies of data as and when they need it closer to compute which ultimately results in storage not being optimized and becoming increasingly saturated.

In this new 2.0 ecosystem data engineers need to find a way to be able to leverage and work with hybrid environments. While also being able to maintain minimal code changes for their applications and being able to leverage all storage systems available to their fullest.

Today I see data engineers when attempting to work in these hybrid environments have no easy and transparent way to deal with these issues. Many times we tend to make multiple copies across environments in the hopes of trying to achieve locality. While also not being able to use more efficient computes due to API incompatibility. We tend to inexplicably end up overloading storage and not fully leverage other cheaper solutions.

Handling these modern workloads requires a solution that solves a few different problems but most of all one that can serve as a virtualization layer between compute and storage. Similar to how we have orchestration frameworks for technologies like containers, there needs to be an orchestration framework for data. One such open source system is Alluxio (formerly Tachyon Nexus), Alluxio provides capabilities that allow it to function as a modern data orchestration solution.

Alluxio provides a few particular features that a data orchestration framework needs to be successful in Hybrid environments.

A framework that allows engineers to have unified access to data regardless of the storage system it may reside on. This becomes increasingly necessary when also using newer computes that may not natively integrate to a particular storage. Which allows you to not have to worry about using a common interface. Alluxio’s API translation allows users to continue bringing new technologies into their ecosystem while also ensuring a durable consistent way of ensuring that they can be connected. Alluxio’s tiering capability also helps solve the slow data access problem while letting you leverage lower cost storage.

While working in hybrid environments can be challenging, it is something that we must come to grips with in today’s rapidly involving data ecosystem. Modern data orchestration frameworks today while not solving the entire problem have come a long way in making the adaption to hybrid that much easier!

Share this post

Blog

Make Multi-GPU Cloud AI a Reality

If you’re building large-scale AI, you’re already multi-cloud by choice (to avoid lock-in) or by necessity (to access scarce GPU capacity). Teams frequently chase capacity bursts, “we need 1,000 GPUs for eight weeks,” across whichever regions or providers can deliver. What slows you down isn’t GPUs, it’s data. Simply accessing the data needed to train, deploy, and serve AI models at the speed and scale required – wherever AI workloads and GPUs are deployed – is in fact not simple at all. In this article, learn how Alluxio brings Simplicity, Speed, and Scale to Multi-GPU Cloud deployments.

Accelerate your Cloud Object Storage for AI Workloads

Turn your existing S3 storage into an AI-ready storage layer with sub-ms latency and terabytes per second throughout per Alluxio cluster with linear scalability — no data migration required.

Alluxio's Strong Q2: Sub-Millisecond AI Latency, 50%+ Customer Growth, and Industry-Leading MLPerf Results

Alluxio's strong Q2 featured Enterprise AI 3.7 launch with sub-millisecond latency (45× faster than S3 Standard), 50%+ customer growth including Salesforce and Geely, and MLPerf Storage v2.0 results showing 99%+ GPU utilization, positioning the company as a leader in maximizing AI infrastructure ROI.

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer

Request a demo

Alluxio Enterprise AI

Alluxio Enterprise Data

Blog

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer