
We are delighted by the success of the inaugural Data Orchestration Summit on Nov. 7, 2019! Organized by Alluxio, this one-day event was sold out with nearly 400 attendees! Data engineers, cloud engineers, data scientists joined the talks of 24 industry leaders from all over the globe to share their experiences building cloud-native data and AI platforms. All session recordings and slides are now available.
Key Announcements
Haoyuan Li, founder and CTO of Alluxio, opened the summit with his talk - Orchestrate a Data Symphony, where he discusses the key challenges and trends impacting data engineering in relation to building modern data and AI platforms, and explore the concept of Data Orchestration.
In the Alluxio tech talks, founding engineers Calvin Jia, Bin Fan, and Gene Pang dive into Alluxio 2 Series' key features in open source, community updates, and the latest innovations bringing Alluxio open source into the world of structured data.
Session highlights
The featured talks for the Summit highlighted how leading companies architect their data and AI platforms through the data orchestration approach, leveraging open source technologies such as Alluxio, Apache Spark, Presto, and more. Some session highlights include:
- Orchestrate a Data Symphony - Haoyuan Li, Alluxio
- Enterprise Distributed Query Service powered by Presto & Alluxio across clouds at WalmartLabs - Ashish Tadose, Walmart
- How to Run Fast Presto Analytics with Alluxio in Cloud - a Production Experience - Danny Linden, Ryte
- Alluxio tech talks: What’s New in Alluxio 2 - Calvin Jia & Bin Fan, and Alluxio Innovations for Structured Data - Gene Pang
- Open Source Panel: how to create an open source project - Ben Lorica, O’Reilly; Tobi Knaup, D2iQ; Maxime Beauchemin, Preset; Haoyuan Li, Alluxio
- Data Orchestration for Analytics and AI workloads at DBS Bank - Carlos Queiroz, Development Bank of Singapore (recording will soon be available here)
What's next?
- Join the conversations on the community slack channel!
- Given the strong interest, we’re bringing back the hands-on lab, so stay tuned!

Cheers!
Amelia and Bin
Data Orchestration Summit Co-Chairs
.png)
Blog

Alluxio's strong Q2 featured Enterprise AI 3.7 launch with sub-millisecond latency (45× faster than S3 Standard), 50%+ customer growth including Salesforce and Geely, and MLPerf Storage v2.0 results showing 99%+ GPU utilization, positioning the company as a leader in maximizing AI infrastructure ROI.

In this blog, Greg Lindstrom, Vice President of ML Trading at Blackout Power Trading, an electricity trading firm in North American power markets, shares how they leverage Alluxio to power their offline feature store. This approach delivers multi-join query performance in the double-digit millisecond range, while maintaining the cost and durability benefits of Amazon S3 for persistent storage. As a result, they achieved a 22 to 37x reduction in large-join query latency for training and a 37 to 83x reduction in large-join query latency for inference.