Alluxio Blog

Alluxio Launches Industry’s First System to Unify Data at Memory Speed

Our mission at Alluxio is to unify data at memory speed. Today we’re excited to unveil our first products which enable organizations to turn data into value with unprecedented ease, flexibility, and speed. We believe our new products will substantially advance Alluxio for both the community and our enterprise customers.
In this blog, I will share with you the big data challenges application developers and business line owners face today, and show how Alluxio addresses these challenges.

Accelerating Data Analytics on Ceph Object Storage with Alluxio

This is an excerpt from the Accelerating Data Analytics on Ceph Object Storage with Alluxio whitepaper.
As the volume of data collected by enterprises has grown, there is a continual need to find efficient storage solutions. Owing to its simplicity, scalability and cost-efficiency object storage, including Ceph, has increasingly become a popular alternative to traditional file systems. In most cases the object storage system, on-premise or in the cloud, is decoupled from compute nodes where analytics is run. There are several benefits of this separation.

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters

Alluxio is the world’s first memory-speed virtual distributed storage system that bridges applications and underlying storage systems, providing unified data access orders of magnitudes faster than existing solutions. The Hadoop Distributed File System (HDFS) is a distributed file system for storing large volumes of data. HDFS popularized the paradigm of bringing computation to data and the co-located compute and storage architecture.
In this blog, we highlight two key benefits Alluxio brings to a compute cluster co-located with HDFS.

Accelerating On-Demand Data Analytics with Alluxio

This is an excerpt from the Accelerating On-Demand Data Analytics with Alluxio whitepaper, which includes a detailed implementation guide in addition to this high level overview.
In the Big Data world, it is often the case that only a subset of the total data is relevant for answering the question at hand. As a result, the total cost of ownership for long running clusters for analytics is high while utilization is low, especially when adopting an architecture of co-locating compute and storage.

What’s new in Alluxio 1.1 Release

Alluxio 1.1 release includes many great features and improvements from the community. Alluxio would not be what it is today without the growing open source community, and we would like to thank everyone involved in this project.
This release brings many new features and improvements, and in this post, we will highlight a few of the developments: performance improvements, access control features, and usability and integration improvements.

Introducing Alluxio Open Source Project Governance

We’re excited to announce that the Alluxio open source project is adopting the Benevolent Dictator For Life (BDFL) model. The day-to-day management of the project will be carried out by the Project Management Committee (PMC). Within the PMC, there are Maintainers, who are responsible for upholding the quality of the code in their respective components.
With the adoption of a project management mechanism, we believe it will further accelerate the project growth and enable contributors around the world to easily collaborate to bring exciting new features and improvements to Alluxio.

Founder Blog | Alluxio, formerly Tachyon, is Entering a New Era with 1.0 release

Today, we are very excited to announce the 1.0 release of Alluxio, the world’s first memory-centric virtual distributed storage system, which unifies data access and bridges computation frameworks and underlying storage systems. Applications only need to connect with Alluxio to access data stored in any underlying storage systems. Additionally, Alluxio’s memory-centric architecture enables data access orders of magnitude faster than existing solutions.