compute storage separation Archives | Page 9 of 11

Tech Talk: Achieving Separation of Compute and Storage in a Cloud World

February 12, 2019

The rise of compute intensive workloads and the adoption of the cloud has driven organizations to adopt a decoupled architecture for modern workloads – one in which compute scales independently from storage. While this enables scaling elasticity, it introduces new problems – how do you co-locate data with compute, how do you unify data across multiple remote clouds, how do you keep storage and I/O service costs down and many more.

Enter Alluxio, a virtual unified file system, which sits between compute and storage that allows you to realize the benefits of a hybrid cloud architecture with the same performance and lower costs.

Tags: compute storage separation, hybrid cloud, tech talk

Unified Big Data Analytics: Any Stack, Any Cloud

January 22, 2019 by Bin Fan

This presentation focuses on how Alluxio enables the big data analytics stack to be cloud-native. Today’s cloud object storage systems provide more cost-effective and scalable storage solutions but also different semantics and performance implications compared to HDFS.

Tags: architecture, compute storage separation, meetup, presto, sql

Alluxio – Virtual Unified File System

December 6, 2018 by Haoyuan Li

Learn more about Alluxio, a virtual unified file system and data orchestration layer for big data and machine learning workloads in the cloud.

Tags: architecture, beginner, compute storage separation

Alluxio Overview: Open Source Data Orchestration Technology

October 17, 2018

Alluxio is an open source data orchestration platform for large-scale analytics and AI/ML applications. It provides a unified namespace for accessing data distributed across private data centers and clouds, and also provides advanced caching to address issues with data locality, performance, and data egress costs. Alluxio provides the data accessibility, locality, and elasticity needed to reduce complexity and improve the performance for analytics and AI/ML workloads.

Tags: architecture, beginner, compute storage separation

Alluxio Architecture and Data Flow

October 16, 2018

Alluxio was created because we saw a need for innovation at the data layer rising from the growing complexity of connecting multiple compute frameworks to an ever-expanding mix of storage systems and formats. Our approach uses a memory-centric architecture that abstracts files and objects in underlying persistent storage systems and provides a shared data access layer for compute applications.

Alluxio is not a persistent storage system. Instead, Alluxio serves as a data access layer, residing between any persistent storage system (such as Amazon S3, Microsoft Azure Object Store, Apache HDFS or OpenStack Swift) and computation frameworks (such as Apache Spark, Presto or Hadoop MapReduce). This whitepaper provides a technical overview of the Alluxio architecture and describes the data flow for common read and write scenarios.

Tags: architecture, beginner, compute storage separation, overview

Data EcoSystem 2.0

October 1, 2018 by Haoyuan Li

Learn more about data unification for the digital economy and how Alluxio’s data orchestration brings your data to your compute, wherever it’s located.

Tags: architecture, compute storage separation, overview

Intel: How to Use Alluxio to Accelerate Big Data Analytics on the Cloud and New Opportunities with Persistent Memory

October 1, 2018 by Yuan Zhou

Learn how Intel uses Alluxio to accelerate big data analytics in the cloud, as well as new opportunities with persistent memory with separated compute and storage.

Tags: apache spark, aws s3, benchmark, compute storage separation, partner

Tag: compute storage separation