A collaboration of Alibaba, Alluxio, and Nanjing University in tackling the problems of Deep Learning model training in the cloud. Our goal was to reduce the cost and complexity of data access for Deep Learning training in a hybrid environment, which resulted in over 40% reduction in training time and cost.
Find our rich collection of White Papers, Case Studies, Presentations, and Videos here.
This article describes how Alluxio can accelerate the training of deep learning models in a hybrid cloud environment when using Intel’s Analytics Zoo open source platform, powered by oneAPI. Details on the new architecture and workflow, as well as Alluxio’s performance benefits and benchmarks results will be discussed.
Are you using SQL engines, such as Presto, to query existing Hive data warehouse and experiencing challenges including overloaded Hive Metastore with slow and unpredictable access, unoptimized data formats and layouts such as too many small files, or lack of influence over the existing Hive system and other Hive applications?
Introduction The exponential growth of the raw computational power, communication bandwidth, and storage capacity results in continuous innovation in how data is processed and … Continued
Strata+Hadoop World 2016 - Baidu deployed Alluxio to accelerate its big data analytics workload. Bin Fan and Haojun Wang explain why Baidu chose Alluxio, … Continued
Strata+Hadoop World 2016 - Tachyon, a memory-centric fault-tolerant distributed storage system. An introduction of architecture, performance evaluation, and real world use cases. … Continued
Barclays Data Scientist Gianmario Spacagna and Harry Powell, Head of Advanced Analytics, describe how they iteratively process raw data directly from the central data … Continued
ODSC West 2015 - Tachyon, a memory-centric fault-tolerant distributed storage system. An introduction of architecture, performance evaluation, and real world use cases. … Continued
Tachyon: A reliable memory-centric distributed storage system presentation by founder Haoyuan Li. … Continued
Memory is the key to fast big data processing. This has been realized by many, and frameworks such as Spark and Shark already leverage … Continued