Some compute frameworks like Apache Spark have single node-caching, how is Alluxio different than single-node caching?
If you have a hybrid cloud architecture, using either a VPN or a dedicated high-speed circuit, does the network speed become a bottleneck in the hybrid data use case?
How does replication in Alluxio happen across worker nodes? Is the unit of replication a file or a block?