Production Spark and Tachyon Use Cases

Spark Summit Europe *

During the past several years, Spark has significantly changed the landscape of big data computing. It improves performance of various applications dramatically. However, in certain Spark use cases, the bottleneck is in the I/O stack. In this talk, we will introduce Tachyon, a distributed memory-centric storage system. In addition, we will talk about several production use cases where Tachyon further improves Spark applications’ performance by orders of magnitude.

Fast big data analytics and machine learning using Alluxio and Spark in Baidu

Strata+Hadoop World San Jose *

A few months ago, Baidu deployed Alluxio to accelerate its big data analytics workload. Bin Fan and Haojun Wang explain why Baidu chose Alluxio, as well as the details of how they achieved a 30x speedup with Alluxio in their production environment with hundreds of machines. Based on the success of the big data analytics engine, Baidu is currently expanding the Alluxio and Spark infrastructure to accelerate other applications, such as machine learning.

Alluxio (formerly Tachyon): New Features and Demos

Bay Area Meetup *

Big data ecosystem is moving with massive energy, customers are from healthcare, retail, transportation, and other fields are benefiting significantly from the business insights derived. As the data growth continues, storage technologies and distributed memory systems are becoming even more important for real time decision making and insight discovery. Intel is excited to work with developer communities on Alluxio and to optimize Alluxio solutions on Intel platform. In this talk, Ziya will discuss Intel’s optimization work in the area, open source contribution and industry use cases.

Alluxio (formerly Tachyon): Open Source Memory Speed Virtual Distributed Storage System

Data by the Bay San Francisco *

The goal is to make Alluxio accessible to an even wider set of users through a focus on security, new language bindings, and further increased stability. In addition, the team is working on new APIs to allow applications to access data more efficiently and manage data across different under storage systems.

1st Beijing Alluxio (Formerly Tachyon) Meetup

Beijing Meetup *

In the active community development of the past year, Alluxio has greatly improved its read and write performance, scalability and user experience. In addition, in terms of functionality, Alluxio has added a number of new features, such as scalable tiered storage, transparent UFS data reading and writing, unified namespaces, and more. These features bring more value to Alluxio users and more efficient and convenient cluster storage management.

Past, Present and Future of Alluxio [Chinese]

Nanjing Big Data Meetup *

The Alluxio project has greatly improved system performance, Scalability and user experience, and added a series of new features, including scalable tiered storage, transparent UFS data reading and writing, unified namespaces, and more. Easy to use with Alluxio. At the same time, the Alluxio ecosystem has expanded to support different storage systems and computing frameworks. Alluxio now supports a variety of storage systems, including Amazon S3, Google Cloud Storage, Gluster, Ceph, HDFS, NFS and OpenStack Swift, as well as big data processing frameworks such as Spark, MapReduce, Flink and more. These integrations allow Alluxio to manage and help with more and more complex data.

Past, Present and Future of Alluxio [Chinese]

Shanghai Meetup *

The Alluxio project has greatly improved system performance, Scalability and user experience, and added a series of new features, including scalable tiered storage, transparent UFS data reading and writing, unified namespaces, and more. Easy to use with Alluxio. At the same time, the Alluxio ecosystem has expanded to support different storage systems and computing frameworks. Alluxio now supports a variety of storage systems, including Amazon S3, Google Cloud Storage, Gluster, Ceph, HDFS, NFS and OpenStack Swift, as well as big data processing frameworks such as Spark, MapReduce, Flink and more. These integrations allow Alluxio to manage and help with more and more complex data.

Alluxio (formerly Tachyon): The journey thus far and the road ahead

Strata+Hadoop World New York *

The goal is to make Alluxio accessible to an even wider set of users through a focus on security, new language bindings, and further increased stability. In addition, the team is working on new APIs to allow applications to access data more efficiently and manage data across different under storage systems.