Products
Resource Hub
.png)
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.


Blog

Blog
Modernize your analytics workloads with NetApp and Alluxio
This blog was originally published on the website of NetApp: https://www.netapp.com/blog/modernize-analytics-workloads-netapp-alluxio/
Imagine as an IT leader having the flexibility to choose any services that are available in public cloud and on premises. And imagine being able to scale your storage for your data lakes with control over data locality and protection for your organization. With these goals in mind, NetApp and Alluxio are joining forces to help our customers adapt to new requirements for modernizing data architecture with low-touch operations for analytics, machine learning, and artificial intelligence workflows.
Hybrid Multi-Cloud
Data Platform Modernization
Large Scale Analytics Acceleration
.jpeg)

Blog
.jpeg)
Blog
Designing the Presto Local Cache at Uber A collaboration between Uber and Alluxio part 2
In the previous blog, we introduced Uber’s Presto use cases and how we collaborated to implement Alluxio local cache to overcome different challenges in accelerating Presto queries. The second part discusses the improvements to the local cache metadata.
Large Scale Analytics Acceleration


Blog

Blog
Whats New in Alluxio 2.8: Enhanced S3 API Functionality Enterprisegrade Security and Data Migration With Better Usability and Low Cost
The Alluxio 2.8 version focuses on the S3 API, enterprise-grade security, scalability and observability in data migration. Enhanced S3 API makes managing Alluxio easier than ever. Features such as encryption at rest and policy-driven data management further improve Alluxio’s functionality to support enterprise customers.
No items found.
.jpeg)

Blog
.jpeg)
Blog
From Zookeeper to Raft How Alluxio Stores File System State with High Availability and Fault Tolerance
Raft is an algorithm for state machine replication as a way to ensure high availability (HA) and fault tolerance. This blog shares how Alluxio has moved to a Zookeeper-less, built-in Raft-based journal system as a HA implementation.
Large Scale Analytics Acceleration
.jpeg)

Blog
.jpeg)
Blog
Recommendations to Level Up Your Machine Learning Platform
With machine learning (ML) and artificial intelligence (AI) applications becoming more business-critical, organizations are in the race to advance their AI/ML capabilities. To realize the full potential of AI/ML, having the right underlying machine learning platform is a prerequisite.
Data Migration
GPU Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Orchestrating Data for Machine Learning Pipelines
This article will discuss a new solution to orchestrating data for end-to-end machine learning pipelines that addresses the above questions. I will outline common challenges and pitfalls, followed by proposing a new technique, data orchestration, to optimize the data pipeline for machine learning.
GPU Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Improving Presto Architectural Decisions with Alluxio Shadow Cache at Meta Facebook
With the collaboration between Meta (Facebook), Princeton University, and Alluxio, we have developed "Shadow Cache" – a lightweight Alluxio component to track the working set size and infinite cache hit ratio. Shadow cache can keep track of the working set size over the past window dynamically and is implemented by a series of bloom filters. Shadow cache is deployed in Meta (Facebook) Presto and is being leveraged to understand the system bottleneck and help with routing design decisions.
Large Scale Analytics Acceleration


Blog

Blog
Accelerate Auto Data Tagging with Alluxio and Spark in Hybrid Cloud A Practice in WeRide
This blog shares the practice of using Alluxio and Spark to accelerate the auto data tagging system in WeRide, an autonomous driving technology company.
Hybrid Multi-Cloud
GPU Acceleration
Large Scale Analytics Acceleration
Model Training Acceleration
.jpeg)

Blog
.jpeg)
Blog
Pair Spark with Alluxio to Modernize Your Data Platform
Alluxio is the data orchestration platform to unify data silos across heterogeneous environments. This is the last article in a series to give you the basics of Alluxio’s architecture and solution.
Data Platform Modernization
GPU Acceleration
Your selections don't match any items.


.jpeg)
.jpeg)