Sessions Include: Architecting data platforms with Presto and Alluxio; Microservice to analyze Presto queries panel with ByteDance; and Speeding up Presto at Uber with Alluxio Caching
SAN MATEO, CA – July 13, 2022 - Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, today announced its participation in PrestoCon Day, a day dedicated to all things Presto taking place virtually on Thursday, July 21, 2022. Alluxio will also be hosting a Presto Committer Virtual Office Hour to answer any questions related to Presto and Alluxio.
Alluxio Sessions at PrestoCon
July 21 at 11:40 am PT – “Architecting your Data Platform with Presto and Alluxio in Heterogenous Environments,” by Adit Madan, director of product management at Alluxio
As the cloud is evolving and the adoption of a hybrid-cloud or multi-cloud approach grows, the data architecture must adapt to heterogeneous environments.In this talk, Adit Madan shares insights on how to architect a data platform with Presto and Alluxio that provides agility and simplicity to your data team.
July 21 at 11:45 am PT – “Dynamic UDF Framework and its Applications,” by Rongrong Zhong at Alluxio and Yanbing Zhang, software engineer at Bytedance.
In this talk, Rongrong and Yanbing will talk about a microservice that they built at Uber to analyze Presto queries. The Presto Query Engine does not provide endpoints for query analysis purposes.One has to either execute the query or gather insights from the query explain plan. In this talk, they will talk about 1.The work that they had to do to do the query analysis in a microservice using Presto as a library. 2.Doing predicate analysis on the queries to come up with data formatting recommendations in order to improve query performance. 3.Using the analysis service for query result cache invalidation. The analysis figures out whether the results from a previous run of the query are still valid and can be reused.
July 21 at 1:15 pm PT - “Speeding up Presto at Uber with Alluxio Caching,” by Chen Liang, senior software engineer at Uber and Beinan Wang, software engineer at Alluxio
At Uber, Presto is heavily used as one of the primary data analytics tools, and Presto's query performance has profound production impact at Uber. As part of the Presto optimization effort, Uber turned to explore Alluxio as a caching solution. Alluxio is an open source data orchestration platform often used by many compute frameworks as the caching layer. Alluxio caching is currently enabled on ~2000 nodes across 6 clusters at Uber. This session will present Uber’s journey integrating Alluxio cache into Presto.It will review the specific challenges encountered and how they were addressed. It will also share their performance improvements.Lastly, this session will discuss plans and next steps, and potential future collaboration opportunities with the community.
View all the sessions in the full program schedule. PrestoCon Day is a free virtual event and registration is open.
Tweet this: @AlluxioIO announces its participation in #PrestoCon Day #cloud #opensource #analytics #presto https://bit.ly/3ORA32q
About Alluxio
Alluxio is a leading provider of accelerated data access platforms for AI workloads. Alluxio’s distributed caching layer accelerates AI and data-intensive workloads by enabling high-speed data access across diverse storage systems. By creating a global namespace, Alluxio unifies data from multiple sources—on-premises and in the cloud—into a single, logical view, eliminating the need for data duplication or complex data movement.
Designed for scalability and performance, Alluxio brings data closer to compute frameworks like TensorFlow, PyTorch, and Spark, significantly reducing I/O bottlenecks and latency. Its intelligent caching, data locality optimization, and seamless integration with modern data platforms make it a powerful solution for teams building and scaling AI pipelines across hybrid and multi-cloud environments. Backed by leading investors, Alluxio powers technology, internet, financial services, and telecom companies, including 9 out of the top 10 internet companies globally. To learn more, visit www.alluxio.io.
Media Contact:
Beth Winkowski
Winkowski Public Relations, LLC for Alluxio
978-649-7189
beth@alluxio.com
.png)
News & Press
AMSTERDAM, NETHERLANDS, JUNE 10, 2025 — In today’s confusing and messy enterprise software market, innovative technology solutions that realize real customer results are hard to come by. As an industry analyst firm that focuses on enterprise digital transformation and the disruptive vendors that support it, Intellyx interacts with numerous innovators in the enterprise IT marketplace.
Alluxio, supplier of open source virtual distributed file systems, announced Alluxio Enterprise AI 3.6. This delivers capabilities for model distribution, model training checkpoint writing optimization, and enhanced multi-tenancy support. It can, we’re told, accelerate AI model deployment cycles, reduce training time, and ensure data access across cloud environments. The new release uses Alluxio Distributed Cache to accelerate model distribution workloads; by placing the cache in each region, model files need only be copied from the Model Repository to the Alluxio Distributed Cache once per region rather than once per server.