alluxio Archives

Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds

TDEA | Alluxio * June 30, 2022

Alluxio foresaw the need for agility when accessing data across silos separated from compute engines like Spark, Presto, Tensorflow and PyTorch. Embracing the separation of storage from compute, the Alluxio data orchestration platform simplifies adoption of the data lake and data mesh paradigm for analytics and AI/ML.

Intel Innovation 2021

Virtual Event * October 27, 2021

Join us at Intel Innovation, the latest digital educational conference for developers and industry insiders. You’ll hear from the experts who deliver advanced AI, 5G, edge, cloud, and client technologies with speed and real-world scale. Exclusive sessions include product launches, demos, hands-on workshops, keynotes, and a sneak peek at Intel’s road map. Secure your spot at Intel Innovation.

Building High-Performance Data Lake Using Apache Hudi and Alluxio at T3Go

November 20, 2020 By Trevor Zhang (T3Go), Vino Yang (T3Go), Jasmine Wang and Bin Fan

How T3Go’s high-performance data lake using Apache Hudi and Alluxio shortened the time for data ingestion into the lake by up to a factor of 2. Data analysts using Presto, Hudi, and Alluxio in conjunction to query data on the lake saw queries speed up by 10 times faster.

Accelerating Data Computation on Ceph Objects using Alluxio

Alluxio Global Online Meetup * November 10, 2020

In this talk, we will present how using Alluxio computation and storage ecosystems can better interact benefiting of the “bringing the data close to the code” approach. Moving away from the complete disaggregation of computation and storage, data locality can enhance the computation performance.

Accelerating Queries on Cloud Data Lakes

ITPro Today Webinar * August 20, 2020

Join us for this webinar where Alex Ma of Alluxio, an open source data orchestration platform, will discuss how a data orchestration approach offers a solution for connecting traditional on-prem data centers with the cloud, data centers with other data centers, and clouds with other clouds. With Alluxio’s “zero-copy” burst solution, companies can bridge remote data centers with computing frameworks in other locations, enabling them to offload compute and leverage the flexibility, scalability, and power of the cloud for their remote data.

Enabling Hybrid Cloud Analytics and AI with Data Orchestration

IoT World Today Webinar * August 5, 2020

Adit Madan and Parviz Peiravi offer an overview of the Alluxio data orchestration layer that provides a unified data access layer for hybrid and multi cloud deployments, leveraging Intel® Optane™ Persistent Memory for higher performance caching at reduced cost. The data access layer enables distributed compute engines like Presto, TensorFlow, and PyTorch to transparently access data from various storage systems (including S3, HDFS, and Azure) while actively leveraging a multi-tier cache to accelerate data access.

Ultra Fast Deep Learning in Hybrid Cloud Using Intel Analytics Zoo & Alluxio

April 23, 2020

Today, many people run deep learning applications with training data from separate storage such as object storage or remote data centers. This presentation will demo the Intel Analytics Zoo + Alluxio stack, an architecture that enables high performance while keeping cost and resource efficiency balanced without network being I/O bottlenecked.

Tags: alluxio, analytics zoo, deep learning applications, high performance, hybrid cloud, intel, meetup, storage

The Practice of Presto & Alluxio in E-Commerce Big Data Platform

November 15, 2019

JD.com is China’s largest online retailer. It uses Alluxio to provide support for ad hoc and real-time stream computing, using Alluxio-compatible HDFS URLs and Alluxio as a pluggable optimization component.

Tags: alluxio, big data, performance, presto, use case

Workshop: Presto on Alluxio Hands-On Lab

November 12, 2019

Get started with Presto and Alluxio – Hands-on experience launching the EC2 instance, explore the Alluxio filesystem and cluster status, and run queries with Presto on Alluxio

Tags: alluxio, compute storage separation, conference, data orchestration, data orchestration summit, presto

Tag: alluxio