Alluxio Will Present at ApacheCon about Data Orchestration for Machine Learning
September 15, 2021

SAN MATEO, CA – September 15, 2021 - Alluxio, the developer of open source data orchestration software for large-scale analytics and AI/ML workloads, today announced that two of its senior engineers will lead a session at ApacheCon being held virtually on September 21-23, 2021.

Alluxio Session Details:

Session Title: “Alluxio Data Orchestration for Machine Learning” under the Big Data track

Session Time: Wednesday, September 22 at 15:50 UTC / 11:50 AM ET / 8:50 AM PT

Session Presenters: Alluxio’s Founding Engineer and VP of Open Source Bin Fan; and Software Engineer Lu Qiu

Session Overview: Alluxio’s capabilities as a Data Orchestration framework have encouraged users to onboard more of their data-driven applications to an Alluxio powered data access layer. Driven by strong interests from our open-source community, the core team of Alluxio started to re-design an efficient and transparent way for users to leverage data orchestration through the POSIX interface. This effort has a lot of progress with the collaboration with engineers from Microsoft, Alibaba and Tencent. Particularly, we have introduced a new JNI-based FUSE implementation to support POSIX data access, created a more efficient way to integrate Alluxio with FUSE service, as well as many improvements in relevant data operations like more efficient distributedLoad, optimizations on listing or calculating directories with a massive amount of files, which are common in model training. We will also share our engineering lessons and roadmap in future releases to support Machine Learning applications.

ApacheCon is the official global conference series of The Apache Software Foundation (ASF). Since 1998 – before the ASF’s incorporation – ApacheCon has been drawing participants at all levels to explore ”Tomorrow’s Technology Today” across 300+ Apache projects and their diverse communities. ApacheCon showcases the latest developments in Apache projects and emerging innovations through hands-on sessions, keynotes, real-world case studies, trainings, hackathons, and more.To register for ApacheCon, visit here (https://www.apachecon.com/acah2021/register.html).

Tweet this:  @Alluxio will present at @ApacheCon on #DataOrchestration for #MachineLearning https://bit.ly/3zGUhVy #cloud #opensource

About Alluxio

Alluxio is a leading provider of accelerated data access platforms for AI workloads. Alluxio’s distributed caching layer accelerates AI and data-intensive workloads by enabling high-speed data access across diverse storage systems. By creating a global namespace, Alluxio unifies data from multiple sources—on-premises and in the cloud—into a single, logical view, eliminating the need for data duplication or complex data movement.

Designed for scalability and performance, Alluxio brings data closer to compute frameworks like TensorFlow, PyTorch, and Spark, significantly reducing I/O bottlenecks and latency. Its intelligent caching, data locality optimization, and seamless integration with modern data platforms make it a powerful solution for teams building and scaling AI pipelines across hybrid and multi-cloud environments. Backed by leading investors, Alluxio powers technology, internet, financial services, and telecom companies, including 9 out of the top 10 internet companies globally. To learn more, visit www.alluxio.io.

Media Contact:
Beth Winkowski
Winkowski Public Relations, LLC for Alluxio
978-649-7189
beth@alluxio.com

News & Press

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer