A collaboration of Alibaba, Alluxio, and Nanjing University in tackling the problems of Deep Learning model training in the cloud. Our goal was to reduce the cost and complexity of data access for Deep Learning training in a hybrid environment, which resulted in over 40% reduction in training time and cost.
Find our rich collection of White Papers, Case Studies, Presentations, and Videos here.
This article describes how Alluxio can accelerate the training of deep learning models in a hybrid cloud environment when using Intel’s Analytics Zoo open source platform, powered by oneAPI. Details on the new architecture and workflow, as well as Alluxio’s performance benefits and benchmarks results will be discussed.
Are you using SQL engines, such as Presto, to query existing Hive data warehouse and experiencing challenges including overloaded Hive Metastore with slow and unpredictable access, unoptimized data formats and layouts such as too many small files, or lack of influence over the existing Hive system and other Hive applications?
In this online presentation, we present how ING is leveraging Presto (interactive query), Alluxio (data orchestration & acceleration), S3 (massive storage), and DC/OS (container … Continued
EMR has become a widely used service to run big data analytics in the public cloud. But issues around slow/inconsistent EMR performance due to … Continued
This tech talk gives shows how to set up EMR Spark and Hive with Alluxio to seamlessly read/write to your S3 data lake, along … Continued
360 & Alluxio joint meetup in Beijing covers topics on distributed storage and Alluxio application practice. … Continued
Learn how to set up EMR Spark with Alluxio so Spark jobs can seamlessly read from and write to S3. See the performance comparison … Continued
Bay Area Meetup which include presentations on the architecture of Presto, its separation of compute and storage, cloud-readiness, recent advancements in the project such … Continued
Alluxio's first cloud, data & orchestration Austin meetup featuring talks and demos on efficient data engineering with Apache Spark, Hive and Alluxio on S3. … Continued