Learn about Alibaba's use case in deep learning and gene computing acceleration using Alluxio in Kubernetes. … Continued
On-Demand Videos
This talk includes why Netflix needed to build Iceberg, the project’s high-level design, and will highlight the details that unblock better query performance. … Continued
This talk covers an overview of the project and highlight best practices for creating performant input pipelines. … Continued
Learn why leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements. Hear about how Spark and … Continued
Want to leverage your existing investments in Hadoop with your data on-premise and still benefit from the elasticity of the cloud? Like other Hadoop … Continued
Learn more about Bazaarvoice's use case leveraging Apache Spark, Hive, and Alluxio on S3. Along with how to set up Hive with Alluxio so … Continued
Haoyuan Li offers an overview of a data orchestration layer that provides a unified data access and caching layer for single cloud, hybrid, and … Continued
In this online presentation, we present how ING is leveraging Presto (interactive query), Alluxio (data orchestration & acceleration), S3 (massive storage), and DC/OS (container … Continued
EMR has become a widely used service to run big data analytics in the public cloud. But issues around slow/inconsistent EMR performance due to … Continued