data orchestration Archives | Page 6 of 16

Integrating Open Source Alluxio in AWS EKS with Terraform

April 1, 2021

The presentation talks about the best practices to set up and techniques to build a cluster with open source Alluxio on AWS EKS, for one of our clients, which made it Scalable, Reliable, and Secure by adapting to Kubernetes RBAC.

Tags: aws, data orchestration, eks, kubernetes, meetup, terraform

Accelerate Analytics and ML in the Hybrid Cloud Era

February 23, 2021

Many companies we talk to have on premises data lakes and use the cloud(s) to burst compute. Many are now establishing new object data lakes as well. As a result, running analytics such as Hive, Spark, Presto and machine learning are experiencing sluggish response times with data and compute in multiple locations. We also know there is an immense and growing data management burden to support these workflows.

Tags: analytics, data orchestration, hybrid cloud, machine learning, overview, webinar

Accelerate Analytics and ML in the Hybrid Cloud Era

Alluxio Tech Talk * April 6, 2021

In this talk, we will walk through what Alluxio’s Data Orchestration for the hybrid cloud era is and how it solves the performance and data management challenges we see.

Accelerate Analytics and ML in the Hybrid Cloud Era

Alluxio Tech Talk * February 23, 2021

In this talk, we will walk through what Alluxio’s Data Orchestration for the hybrid cloud era is and how it solves the performance and data management challenges we see.

Alluxio Architecture and Scaling Performance

December 13, 2020

In this talk, I will introduce the high-level architecture of the current system, and present the various components of Alluxio. Also, I will discuss some of the main challenges of large scale Alluxio deployments, and the lessons we learned from those environments. This talk will detail some of the major scalability improvements added in the past several months, and how users can benefit from the changes.

Tags: architecture, data orchestration, data orchestration summit, scalability

Modernizing Global Shared Data Analytics Platform and our Alluxio Journey

December 13, 2020

In this keynote, you will learn about the evolution of the global data platform at Rakuten spread across multiple regions, and clouds. In addition, you will hear about the journey across the years, and the use of data orchestration for multiple use cases.

Tags: data analytics, data orchestration, data orchestration summit, rakuten

Introducing the Hub for Data Orchestration

December 13, 2020

We introduce Data Orchestration Hub, a management service that makes it easy to build an analytics or machine learning platform on data sources across regions to unify data lakes. Easy to use wizards connect compute engines, such as Presto or Spark, to data sources across data centers or from a public cloud to a private data center. In this session, you will witness the use of “The Hub” to connect a compute cluster in the cloud with data sources on-premises using Alluxio. This new service allows you to build a hybrid cloud on your own, without the expertise needed to manage or configure Alluxio.

Tags: data orchestration, data orchestration summit, hub

The Future of Computing is Distributed

December 13, 2020

Distributed applications are not new. The first distributed applications were developed over 50 years ago with the arrival of computer networks, such as ARPANET. Since then, developers have leveraged distributed systems to scale out applications and services, including large-scale simulations, web serving, and big data processing. However, until recently, distributed applications have been the exception, rather than the norm. However, this is changing quickly.

Tags: data orchestration, data orchestration summit, distributed applications

The Pandemic Changes Everything, The need for speed and resiliency

December 13, 2020

This is an open source community conference focused on the key data engineering challenges and solutions around building cloud-native data and AI platforms using latest technologies such as Alluxio, Apache Spark, Apache Airflow, Presto, Tensorflow, and Kubernetes.

Tags: data orchestration, data orchestration summit, intel

Tag: data orchestration