alluxio learning center

Beginner to advanced topics on analytics, AI/ML, storage, and cloud concepts


Introduction to Amazon EMR and MapReduce
Amazon Elastic MapReduce (EMR) is a tool for processing and analyzing big data quickly. Using query tools like Spark, Hive, HBase, and Presto along with storage (like S3) and compute capacity (like EC2).

FAQ on Amazon EMR and EC2
The key differences between Amazon EMR and EC2, and how EMR works.


Introduction to Presto and commonly asked questions
Presto was originally designed at Facebook to run interactive queries against large data warehouses in Hadoop and run fast queries against data warehouses storing petabytes of data.