Author: Hope Wang at Alluxio

How Can AI Platforms Adapt to Hybrid or Multi-Cloud Environments?

May 20, 2024 By Hope Wang

This article was originally published on Spiceworks. https://www.spiceworks.com/tech/artificial-intelligence/guest-article/adapting-ai-platform-to-hybrid-cloud/ This blog discusses the challenges of implementing AI platforms in hybrid and multi-cloud environments and shares examples of organizations that have prioritized security and optimized cost management using the data access layer. In recent years, AI platforms have undergone significant transformations as GenAI and AI continue to … Continued

Maximize GPU Utilization for Model Training

April 3, 2024 By Hope Wang

GPU utilization or GPU usage, is the percentage of GPUs’ processing power being used at a particular time. As GPUs are expensive resources, optimizing their utilization and reducing idle time is essential for enterprise AI infrastructure. This blog explores bottlenecks hindering GPU utilization during model training and provides solutions to maximize GPU utilization. 1. Why … Continued

IWD 2024: Empower Women Developers in the Open-Source Community

March 29, 2024 By Hope Wang

This article was originally published on ITBrief. The author is Hope Wang, Developer Advocate, Alluxio. As we celebrate International Women’s Day, it is important to reflect on the progress we have made toward gender equality in the tech industry, particularly in open-source software (OSS). While there is still much work to be done, I am … Continued

Setting the Stage for Alluxio Community to Soar in the Year of the Dragon: 2023 Recap and 2024 Outlook

January 9, 2024 By Hope Wang, Chanchan Mao, Bin Fan, Shouwei Chen, Tango Tian, Tianyu Wang, Shun Lv and Allan Sha

As we step into 2024, we look back and celebrate an incredible year of 2023 for the Alluxio community. First and foremost, thank you to all of our contributors and the broader community! Together, we have achieved remarkable milestones. 💖 📈 Highlights by Numbers Let’s take a look at the Alluxio in 2023 by numbers. … Continued

Why Adding NAS/NFS on Object Storage May not Solve Your Data Access Problem of AI

November 28, 2023 By Tarik Bennett, Beinan Wang and Hope Wang

In this blog, we discuss the data access challenges in AI and why commonly used NAS/NFS may not be a good option for your organization. 1. Early Architecture of AI/ML According to Gartner, although LLMs are on the hype, most organizations are in the early stages, with some in production. In the early stages of … Continued

GPUs Are Fast, I/O is Your Bottleneck

November 7, 2023 By Hope Wang

This article was initially posted on ITOpsTimes. Unless you’ve been living off the grid, the hype around Generative AI has been impossible to ignore. A critical component fueling this AI revolution is the underlying computing power, GPUs. The lightning-fast GPUs enable speedy model training. But a hidden bottleneck can severely limit their potential – I/O. If … Continued

A Deep Dive into Caching in Presto

October 11, 2023 By Hope Wang and Beinan Wang

This article was initially posted on InfoWorld. Understand the caching mechanisms for the popular distributed SQL engine and how to use them to improve query speed and efficiency. Presto is a popular, open source, distributed SQL engine that enables organizations to run interactive analytic queries on multiple data sources at a large scale. Caching is a typical optimization … Continued

Alluxio Kubernetes Operator Tutorial: Simplifying Deploying and Managing Alluxio Clusters

August 14, 2023 By Shawn Sun, Beinan Wang and Hope Wang

This blog provides a tutorial on using the Kubernetes operator to simplify deploying and managing Alluxio clusters on Kubernetes. Introduction The Alluxio Kubernetes operator makes deploying and managing Alluxio and the datasets on Kubernetes easier. With the operator, Alluxio clusters can be deployed and managed seamlessly like any other native Kubernetes application. The operator handles … Continued

Speed Trino Queries with These Performance-Tuning Tips

August 2, 2023 By Hope Wang and Beinan Wang

Originally published at The New Stack: https://thenewstack.io/speed-trino-queries-with-these-performance-tuning-tips/ In this article, we will discuss how data engineers and data infrastructure engineers can make Trino, a widely used query engine that’s faster and more efficient. An open source distributed SQL query engine, Trino is widely used for data analytics on distributed data storage. Optimizing Trino to make it faster … Continued