compute storage separation Archives | Page 4 of 11

Hybrid Environments for Data Analytics is a Possibility

June 21, 2019 By Madan Kumar and Adit Madan

As the data ecosystem becomes massively complex and more and more disaggregated, data analysts and end users have trouble adapting and working with hybrid environments. The proliferation of compute applications along with storage mediums leads to a hybrid model that we are just not accustomed to.
With this disaggregated system data engineers now come across a multitude of problems that they must overcome in order to get meaningful insights.

Building fast and scalable big data and ML platforms at Pinterest and JD.com

Bay Area Meetup * June 19, 2019

This Alluxio Meetup features a chance to interact with other Alluxio users and developers, as well as three talks. Thanks to our joint host Data Council!

Decoupling Compute and Storage for Data Workloads

May 31, 2019 by Carlos Queiroz, DBS

Carlos Queiroz of DBS presents on how to decouple compute and storage for data workloads using Alluxio.

Tags: compute storage separation, hadoop, meetup

Running Spark & Alluxio in Kubernetes

Alluxio Community Office Hour * June 25, 2019

The latest advances in container orchestration by Kubernetes bring cost savings and flexibility to compute workloads in public or hybrid cloud environments. On the other hand, it introduces new challenges such as how to move data to compute efficiently, how to unify data across multiple or remote clouds, how to co-locate data with compute and many more. Alluxio approaches these problems in a new way. It helps elastic compute workloads realize the true benefits of the cloud, while bringing data locality and data accessibility to workloads orchestrated by Kubernetes

Evolution of big data stacks under computational and storage separation architecture

Shanghai * May 19, 2019

A new generation of open source big data, represented by Alluxio, born at the University of California at Berkeley, looks at this issue. Different from systems such as designing storage tight coupling to achieve low-cost reliable storage HDFS, by providing a virtual data storage layer defined and implemented by software for data applications, abstracting and integrating cloudy, hybrid cloud, multi-data center and other environments The underlying files and objects, and through intelligent workload analysis and data management, make data close to computing and provide data locality, big data and machine learning applications can be achieved with the same performance and lower cost.

Meetup: Data Transformation in Financial Services, Featuring DBS Bank

Singapore * May 21, 2019

Hear how DBS Bank is taking a new approach to making data-intensive compute independent of the storage. They will share the challenges as well as the new technology stack that includes technologies like Spark, Alluxio and object stores.

Running Presto with Alluxio on Amazon EMR

Alluxio Community Office Hour - May * May 21, 2019

Many organizations are leveraging EMR to run big data analytics on public cloud. However, reading and writing data to S3 directly can result in slow and inconsistent performance. Alluxio is a data orchestration layer for the cloud, and in this use case it caches data for S3, ensuring high and predictable performance as well as reduced network traffic.

Tag: compute storage separation