sql Archives | Page 4 of 4

MOMO: Accelerating Ad Hoc Analysis with Spark SQL and Alluxio

March 20, 2018 By MOMO Team

Alluxio clusters act as a data access accelerator for remote data in connected storage systems. Temporarily storing data in memory, or other media near compute, accelerates access and provides local performance from remote storage. This capability is even more critical with the movement of compute applications to the cloud and data being located in object stores separate from compute. Caching is transparent to users, using read/write buffering to maintain continuity with persistent storage. Intelligent cache management utilizes configurable policies for efficient data placement and supports tiered storage for both memory and disk (SSD/HDD).

Whitepaper: MOMO – Accelerating Ad Hoc Analysis with Spark SQL and Alluxio

March 19, 2018

From our friends at MOMO The hadoop ecosystem makes many distributed system/algorithms easier to use and generally lowers the cost of operations. However, enterprises and vendors are never satisfied with that, so higher performance becomes the next issue. We considered several options to address our performance needs and focused our efforts on Alluxio, which improves performance … Continued

Tags: analytics, apache hadoop, apache spark, caching, case study, sql

Tag: sql