Author: Bin Fan at Alluxio

Setting the Stage for Alluxio Community to Soar in the Year of the Dragon: 2023 Recap and 2024 Outlook

January 9, 2024 By Hope Wang, Chanchan Mao, Bin Fan, Shouwei Chen, Tango Tian, Tianyu Wang, Shun Lv and Allan Sha

As we step into 2024, we look back and celebrate an incredible year of 2023 for the Alluxio community. First and foremost, thank you to all of our contributors and the broader community! Together, we have achieved remarkable milestones. 💖 📈 Highlights by Numbers Let’s take a look at the Alluxio in 2023 by numbers. … Continued

Introducing Alluxio Enterprise AI and A Vision Beyond Unintelligent Storage

October 18, 2023 By Adit Madan, Bin Fan and Haoyuan Li

We take great pride in the Alluxio Data Platform serving many of the most critical data-driven applications in the world as we speak today. Each of us interact with platforms empowered by Alluxio on a daily basis, and unknowingly you are as well. From the voice assistant we speak to, the bank we transact with, … Continued

Introducing DORA: The Next-generation Alluxio Architecture

October 18, 2023 By Beinan Wang, Bin Fan, Bowen Ding, Jiaming Mai, Hua Huang, Lu Qiu, Jianjian Xie, Shawn Sun, Lucy Ge, Chunxu Tang, Kai Zhang and Hope Wang

Today, we are thrilled to launch the Alluxio Enterprise AI product. One of the key innovations is the introduction of the next-generation architecture DORA – a Decentralized Object Repository Architecture. This blog talks about our development of the DORA architecture, including our motivation, design decisions, and implementation. 1. Moving from Data Analytics to the AI … Continued

Millions Saved Annually: Unleashing the Power of Alluxio + HDFS at Uber

May 29, 2023 By Bin Fan, Beinan Wang, Shouwei Chen, Bowen Ding, Jiaming Mai, Jianjian Xie and Hope Wang

In October 2022, Uber’s Presto team shared in a blog post using the Alluxio SDK cache to boost Presto query performance and cost efficiency. This achievement is a major milestone in the collaboration between Alluxio and Uber. Thus far, the Uber Presto team has implemented the Alluxio SDK cache in three production clusters spanning over … Continued

Announcing Our First AI 🤖 PMC Member: CacheGPT

April 1, 2023 By Bin Fan, Yuyang Wang, Beinan Wang and Hope Wang

We are thrilled to announce that CacheGPT, a state-of-the-art natural language generation model, has joined the Alluxio Project Management Committee (PMC) as our newest member! CacheGPT has been an active contributor to Alluxio since the beginning of this year. It reviews pull requests and draft documentation using only emojis! See our new emoji-enriched documentation here! … Continued

Hopping into the Year of Rabbit with Alluxio Community

January 26, 2023 By Bin Fan, Jasmine Wang, Hope Wang and Chanchan Mao

As we close out the Year of Tiger and welcome the Year of Rabbit, we are filled with gratitude for the support and contributions of the members of Alluxio Open Source Community. Thanks to your dedication and trust, the Alluxio Open Source project and community has continued to thrive and grow in ways we never … Continued

What’s Next for Data Analytics, AI, and Cloud in 2023?

December 27, 2022 By Bin Fan

Originally published on vmblog.com: https://vmblog.com/archive/2022/12/27/alluxio-2023-predictions-what-s-next-for-data-analytics-ai-and-cloud-in-2023.aspx As we enter 2023, the world of analytics, AI, and cloud is entering an exciting new phase, with a wide range of innovations and developments set to reshape the landscape. Below are some trends that will have the most impact in the coming year. Trend 1: Cloud cost optimization is … Continued

Recommendations to Level Up Your Machine Learning Platform

April 12, 2022 By Bin Fan

With machine learning (ML) and artificial intelligence (AI) applications becoming more business-critical, organizations are in the race to advance their AI/ML capabilities. To realize the full potential of AI/ML, having the right underlying machine learning platform is a prerequisite.

Orchestrating Data for Machine Learning Pipelines

April 8, 2022 By Bin Fan

This article will discuss a new solution to orchestrating data for end-to-end machine learning pipelines that addresses the above questions. I will outline common challenges and pitfalls, followed by proposing a new technique, data orchestration, to optimize the data pipeline for machine learning.

Bin Fan

VP Open Source and Founding Engineer, Alluxio