This talk includes why Netflix needed to build Iceberg, the project’s high-level design, and will highlight the details that unblock better query performance. … Continued
Slides from our latest talks
This talk covers an overview of the project and highlight best practices for creating performant input pipelines. … Continued
ODSC WEST 2019 Cloud storage brings great flexibility in management and cost-efficiency to data scientists, but also introduces new challenges related to data accessibility … Continued
Learn why leading companies are moving towards a decoupled compute and storage architecture, and the associated challenges and requirements. Hear about how Spark and … Continued
Want to leverage your existing investments in Hadoop with your data on-premise and still benefit from the elasticity of the cloud? Like other Hadoop … Continued
Vitaliy and Dipti dive into how DBS Bank built a modern big data analytics stack, leveraging an object store as persistent storage even for … Continued
This online meetup shows why and how we solve some challenging technical issues, improve the speed, and reduce the costs of our AWS EMR … Continued
In this talk, we present: trends and challenges in the data ecosystem in cloud era; Data engineering in the cloud with data orchestration; Use … Continued
Learn more about Bazaarvoice's use case leveraging Apache Spark, Hive, and Alluxio on S3. Along with how to set up Hive with Alluxio so … Continued