architecture Archives | Page 4 of 6

Achieving Separation of Compute and Storage in a Cloud World

Alluxio Tech Talk * February 12, 2019

In this tech talk, we will discuss why leading enterprises are adopting hybrid cloud architectures with compute and storage disaggregated, the new challenges that this new paradigm introduces, and the unified data solution Alluxio provides for hybrid environments.

Unified Big Data Analytics: Any Stack, Any Cloud

January 22, 2019 by Bin Fan

This presentation focuses on how Alluxio enables the big data analytics stack to be cloud-native. Today’s cloud object storage systems provide more cost-effective and scalable storage solutions but also different semantics and performance implications compared to HDFS.

Tags: architecture, compute storage separation, meetup, presto, sql

Alluxio – Virtual Unified File System

December 6, 2018 by Haoyuan Li

Learn more about Alluxio, a virtual unified file system and data orchestration layer for big data and machine learning workloads in the cloud.

Tags: architecture, beginner, compute storage separation

Alluxio Overview: Open Source Data Orchestration Technology

October 17, 2018

Alluxio is an open source data orchestration platform for large-scale analytics and AI/ML applications. It provides a unified namespace for accessing data distributed across private data centers and clouds, and also provides advanced caching to address issues with data locality, performance, and data egress costs. Alluxio provides the data accessibility, locality, and elasticity needed to reduce complexity and improve the performance for analytics and AI/ML workloads.

Tags: architecture, beginner, compute storage separation

Alluxio Architecture and Data Flow

October 16, 2018

Alluxio was created because we saw a need for innovation at the data layer rising from the growing complexity of connecting multiple compute frameworks to an ever-expanding mix of storage systems and formats. Our approach uses a memory-centric architecture that abstracts files and objects in underlying persistent storage systems and provides a shared data access layer for compute applications.

Alluxio is not a persistent storage system. Instead, Alluxio serves as a data access layer, residing between any persistent storage system (such as Amazon S3, Microsoft Azure Object Store, Apache HDFS or OpenStack Swift) and computation frameworks (such as Apache Spark, Presto or Hadoop MapReduce). This whitepaper provides a technical overview of the Alluxio architecture and describes the data flow for common read and write scenarios.

Tags: architecture, beginner, compute storage separation, overview

Data EcoSystem 2.0

October 1, 2018 by Haoyuan Li

Learn more about data unification for the digital economy and how Alluxio’s data orchestration brings your data to your compute, wherever it’s located.

Tags: architecture, compute storage separation, overview

Tag: architecture