Google Cloud Dataproc is a widely used fully managed Spark and Hadoop service to run big data analytics and compute workloads in the cloud. … Continued
On-Demand Videos
If you’re a MapR user, you might have concerns with your existing data stack. Whether it’s the complexity of Hadoop, financial instability and no … Continued
Many Spark users may not be aware of the differences in memory utilization between caching data directly in-memory into the Spark JVM versus storing … Continued
The DBS team was tasked to solve their compute capacity problem. They wanted to provide faster insights and analyze data for a range of … Continued
In this panel, creators of open source projects share their stories from why they started the project to the challenges they encountered on the … Continued
In this talk, HY discussed the key challenges and trends impacting data engineering, and explores the concept of Data Orchestration. … Continued
This session talks about challenges associated with querying diverse data sources at Walmart and how those are tackled using Presto & Alluxio. … Continued
In this talk, we share our lessons in building and rebuilding our monitoring systems and data platforms at Electronic Arts (EA). … Continued
Best use cases for Presto from the Data Engineer’s perspective. Also hear about recent Presto advancements such as Cost-Based Optimizer, Kubernetes-native deployment and the … Continued