spark Archives | Alluxio

Integrate Alluxio With Your Existing Data Stack Without Redefining Hive Tables

December 20, 2022 By Greg Palmer and Hope Wang

One of the key features of Alluxio Enterprise Edition is Transparent URI, which provides ease of integration of Alluxio with your existing data stack without any changes to the location metadata of the Hive Metastore. This article provides a tutorial on employing the Alluxio Transparent URI capability with Trino, Hive Metastore and Spark, and with … Continued

Tutorial of Building Multi-Cloud Data Lake using Delta Lake and Alluxio

October 25, 2022 By Zijian Zhu and Hope Wang

This article introduces how to read and write Delta lake tables on Alluxio. You can build multi-cloud data lake using Delta Lake and Alluxio, reducing your data storage costs and increasing flexibility 1. Overview 1.1 About Delta Lake Delta Lake is an open source storage framework that enables building a Lakehouse architecture and brings reliability … Continued

Unifying Cross-region Access in the Cloud at Expedia Group — The Path Toward Data Mesh in the Brand World

July 29, 2022 By Jian Li (Senior Software Engineer @ Expedia Group)

This article shares the data platform practice at Expedia to federate cross-region data lakes spanning multiple geographic regions in the cloud. 1. Background Expedia Group (NASDAQ: EXPE) is an American online travel shopping company for consumer and small business travel. Expedia powers travel for everyone, everywhere through our global platform, with industry-leading technology solutions to … Continued

Accelerate Auto Data Tagging with Alluxio and Spark in Hybrid Cloud – A Practice in WeRide

March 14, 2022 By Feifei Cai and Hao Zhu

This blog shares the practice of using Alluxio and Spark to accelerate the auto data tagging system in WeRide, an autonomous driving technology company.

Alluxio + Spark: Accelerating Auto Data Tagging in WeRide

December 14, 2021

WeRide provides an overview of Alluxio + Spark use case, which has been deployed and running in production to accelerate auto data tagging in the autonomous driving development.

Tags: alluxio day, data tagging, spark, use case

Tag: spark