Join us for these great talks featuring speakers from RisingWave Labs, Onehouse, Shopee, and Alluxio! Learn about how Alluxio helps the big data analytics stack to be cloud-native, why modern data stack is more than a buzzword, an overview of community-driven major features in Apache Hudi’s open-source community, and how Shopee leverages Alluxio to accelerate Presto query. Attendees can join both in-person in Singapore as well as online on Zoom.
Tag: data orchestration
Join fellow Alluxio community users for the 15th Alluxio Community Day virtual event featuring speakers from OceanBase, Twitter, StarTree, and Alluxio!
Join us for the 12th Alluxio Day virtual community event featuring speakers from Shopee, Websec, and Alluxio.
Today, we are excited to announce the launch of Non-fungible token (NFT) as a new feature in our leading data orchestration platform.
Alluxio is the data orchestration platform to unify data silos across heterogeneous environments. This is the last article in a series to give you the basics of Alluxio’s architecture and solution.
By bringing Alluxio together with Spark, you can modernize your data platform in a scalable, agile, and cost-effective way. In this post, we provide an overview of the Spark + Alluxio stack. We explain the architecture, discuss real-world examples, describe deployment models, and showcase performance and cost benchmarking.
Join us for the 10th Alluxio Day virtual community event featuring speakers from Uber, BiliBili, and Alluxio.
As data stewards and security teams provide broader access to their organization’s data lake environments, having a centralized way to manage fine-grained access policies becomes increasingly important. Alluxio can use Apache Ranger’s centralized access policies in two ways: 1) directly controlling access to virtual paths in the Alluxio virtual file system or 2) enforcing existing access policies for the HDFS under stores.
Data platform teams are increasingly challenged with accessing multiple data stores that are separated from compute engines, such as Spark, Presto, TensorFlow or PyTorch. Whether your data is distributed across multiple datacenters and/or clouds, a successful heterogeneous data platform requires efficient data access. Alluxio enables you to embrace the separation of storage from compute and use Alluxio data orchestration to simplify adoption of the data lake and data mesh paradigms for analytics and AI/ML workloads.