The insideBIGDATA IMPACT 50 List for Q4 2023

InsideBigData

The team here at insideBIGDATA is deeply entrenched in keeping the pulse on the big data ecosystem of companies from around the globe. We’re in close contact with the movers and shakers making waves in the technology areas of big data, data science, machine learning, AI and deep learning.

GPUs Are Fast, I/O is Your Bottleneck

SDTimes/ITOpsTimes

Unless you’ve been living off the grid, the hype around Generative AI has been impossible to ignore. A critical component fueling this AI revolution is the underlying computing power, GPUs. The lightning-fast GPUs enable speedy model training. But a hidden bottleneck can severely limit their potential – I/O.

A deep dive into caching in Presto

Infoworld

Presto is a popular, open source, distributed SQL engine that enables organizations to run interactive analytic queries on multiple data sources at a large scale. Caching is a typical optimization technique for improving Presto query performance. It provides significant performance and efficiency improvements for Presto platforms.

Uber takes the fast lane with Alluxio

Blocks & Files

Uber is using Alluxio’s virtual distributed file system to speed its Hadoop-based analytics processing by caching hot read data.

Heard on the Street – 8/24/2023

InsideBigData

Welcome to insideBIGDATA’s “Heard on the Street” round-up column! In this regular feature, we highlight thought-leadership commentaries from members of the big data ecosystem. Each edition covers the trends of the day with compelling perspectives that can provide important insights to give you a competitive advantage in the marketplace.