big data Archives | Page 3 of 7

Recap: AWS Summit New York

July 22, 2019 By Amelia Wong

Alluxio is a proud sponsor and exhibitor at the AWS Summit in New York. If you weren’t able to attend, here are the highlights

Tech Talk: Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage

July 17, 2019

The ever increasing challenge to process and extract value from exploding data with AI and analytics workloads makes a memory centric architecture with disaggregated storage and compute more attractive. This decoupled architecture enables users to innovate faster and scale on-demand. Enterprises are also increasingly looking towards object stores to power their big data & machine learning workloads in a cost-effective way. However, object stores don’t provide big data compatible APIs as well as the required performance.

In this webinar, the Intel and Alluxio teams will present a proposed reference architecture using Alluxio as the in-memory accelerator for object stores to enable modern analytical workloads such as Spark, Presto, Tensorflow, and Hive. We will also present a technical overview of Alluxio.

Tags: big data, compute storage separation, hive, intel, object stores, spark, tech talk, tensorflow

Turn cloud storage or HDFS into your local file system for faster AI model training with TensorFlow

July 3, 2019 By Lu Qiu and Bin Fan

This article aims to provide a different approach to help connect and make distributed files systems like HDFS or cloud storage systems look like a local file system to data processing frameworks: the Alluxio POSIX API. To explain the approach better, we used the TensorFlow + Alluxio + AWS S3 stack as an example.

Recap: Presto Summit SF 2019

July 1, 2019 By Amelia Wong

Alluxio is a proud sponsor and exhibitor at the Presto Summit in San Francisco.
What’s Presto Summit? It’s the leading Presto conference co-organized by our partner Starburst Data and the Presto Software Foundation.

O’Reilly AI Conference Keynote: Data Orchestration for AI, Big Data, and Cloud

June 28, 2019

Haoyuan Li’s keynote at O’Reilly Beijing discusses open source data orchestration and the value of leveraging Alluxio with rising trends driving the need for a new architecture. Four big trends driving this need: Separation of compute & storage, hybrid-multi cloud environments, rise of object store and self-service data across the enterprise.

Tags: big data, cloud, cloud object storage, cloud storage, compute storage separation, conference, data, data orchestration, hybrid cloud, multi cloud, on-prem object storage, storage

Accelerate and Scale Big Data Analytics with Disaggregated Compute and Storage

Alluxio Tech Talk * July 16, 2019

In this tech talk, the Intel and Alluxio teams will present a proposed reference architecture using Alluxio as the in-memory accelerator for object stores to enable modern analytical workloads such as Spark, Presto, Tensorflow, and Hive.

Alluxio at Beijing Meetup

June 25, 2019

Haoyuan Li presents at Beijing Meetup on open source data orchestration and the value of leveraging Alluxio with rising trends driving the need for a new architecture. Four big trends driving this need: Separation of compute & storage, hybrid-multi cloud environments, rise of object store and self-service data across the enterprise.

Tags: big data, cloud, cloud storage, compute storage separation, data, data orchestration, hybrid cloud, meetup, multi cloud, storage

RocksDB Meetup at Twitter

Bay Area Meetup * July 11, 2019

Twitter SF is hosting 2019’s half yearly RocksDB Meetup with speakers from Twitter, Facebook and the community on July 11th.

Open Source Fest with Alluxio & Ignite + How to Accelerate Analytic Queries!

Bay Area Meetup * June 24, 2019

Join us June 24 in Menlo Park for our next meetup! We’ll have 3 valuable talks, a delicious BBQ dinner and amazing summertime-themed raffle prizes! This free event is sponsored by GridGain Systems and Oracle.

Tag: big data