Trino Optimization With Distributed Caching on Data Lakes: Trino Fest 2023 Session Recap

Originally published on trino.io: https://trino.io/blog/2023/07/21/trino-fest-2023-alluxio-recap.html By 2025, there will be 100 zetabytes stored in the cloud. That’s 100,000,000,000,000,000,000,000 bytes – a huge, eye-popping number. But only about 10% of that data is actually used on a regular basis. At Uber, for example, only 1% of their disk space is used for 50% of the data they access … Continued

The Trino Optimization Handbook

Your 🐰 queries are slow 🐢 … you’re frustrated 😩 … Don’t let suboptimal Trino performance hold you back any longer! Unlock the full potential of Trino and transform your data analytics game. Discover the secrets behind Trino’s query engine and learn how to overcome bottlenecks to achieve⚡ blazing-fast  query performance. In this comprehensive guide, … Continued

Tags: , , , ,

Alluxio Product School | Alluxio 2.9 Release Overview

Alluxio Product School *

In November’s Product School, Adit Madan, Director of Product Management at Alluxio, will highlights new features, enhanced manageability, improved security and performance in Alluxio 2.9 release.

Thousand-Node Alluxio Cluster Powers Game AI Platform – A Production Case Study from Tencent

Tencent is one of the largest technology companies in the world and a leader in the gaming sector. The game AI platform supports AI research and development at Tencent. To provide model training with the best experience, Tencent has implemented a 1000-node Alluxio cluster and designed a scalable, robust, and performant architecture to accelerate the game AI training.

Tags: , , , , , , ,

Machine Learning Model Training with Alluxio: Part 2 – Comparable Analysis

This blog is the second in the machine learning series following the previous one, which discussed Alluxio’s solution to improve training performance and simplify data management. With the help of Alluxio, loading data from cloud storage, training and caching data can be done in a transparent and distributed way as a part of the training process, thus improving training performance and simplifying data management. In this blog 2 of the series, we focus on comparing traditional solutions with Alluxio’s.