DATA ORCHESTRATION SUMMIT


DECEMBER 8-9, 2020 | VIRTUAL

About anchor

The virtual event for all building cloud-native data and AI platforms.

This is an open source community conference focused on the key data engineering challenges and solutions around building cloud-native data and AI platforms using latest technologies such as Alluxio, Apache Spark, Apache Airflow, Presto, Tensorflow, and Kubernetes. This Summit brings together data engineers, architects, cloud engineers, data scientists, and industry thought leaders who are solving data problems at the intersection of cloud, AI/ML, and data.

Speakers Anchor

SPEAKERS

ion Stoica

RiseLab

Professor, EECS Dept. at UC Berkeley

Haoyuan Li

Alluxio

Founder, CEO

Rohit Jain

Facebook

Software engineer

Mike Fagan

Comcast

Distinguished Architect

Sandipan Chakraborty

Rakuten

Director of Engineering

Katarzyna Orzechowska

ING Bank

Data Scientist

Serena Wang

Electronic Arts

Software Engineer

Dmytro Dermanskyi

WalkMe

Data Engineering Lead

Jiawei Zhang

Robinhood

Data Infrastructure Engineer

Roderick Yao

Google Cloud

Strategic Cloud Engineer

Frank Hu

ByteDance

Data Platform Tech Lead

Juraj Pohanka

Datasapiens

CTO

Calvin Jia

Alluxio

Founding Engineer, Product

Baolong Mao

Tencent

Sr. System engineer

Yang Che

Alibaba

Staff Engineer

Tom Panozzo

Aunalytics

Analytics Cloud Chief Technology Officer

Bin Fan

Alluxio

Founding Engineer, VP of Open Source

Yichuan Huang

Robinhood

Data Platform engineer

Adit Madan

Alluxio

Product Manager

Gabriel Menegatti

Simbiose Ventures

Director

Mariusz Derela

ING Bank

DevOps Engineer

Wenjun Tao

JD.com

Danny Linden

Ryte

Chapter Lead Software Engineer

Trevor Zhang

T3Go

Big Data Sr. Engineer

See All Speakers

CFP Anchor (not active in menus)

Speak at our conference! CFP is open.

Schedule Anchor

SCHEDULE-AT-A-GLANCE

MORE Program details coming soon.

Times are listed in Pacific Daylight Time (PDT)

Welcome to our second annual Data Orchestration Summit!

KEYNOTES

Details coming soon…
Speakers:
Haoyuan (H.Y.) Li is the Founder and CEO of Alluxio. He graduated with a Computer Science Ph.D. from the AMPLab at UC Berkeley. At the AMPLab, he co-created and led Alluxio (formerly Tachyon), an open source virtual distributed file system. Before UC Berkeley, he got a M.S. from Cornell University and a B.S. from Peking University, all in Computer Science.

Details coming soon…
Speakers:
TBD
Details coming soon…
Speakers:
TBD

Refill your coffee and get ready for what’s next!

CLOUD NATIVE JOURNEYS – MODERNIZING DATA PLATFORMS

Details coming soon…
Speakers:
After receiving her Ph.D. degree from UMass, Boston in 2019, Teng started her career as a software engineer in Electronic Arts, Data Platform & AI Department.  Teng mainly focuses on building high-efficiency data processing platforms and high-impact business intelligent services, with her strong hands-on skills in architecture, design, implementation, and operations.

Details coming soon…
Speakers:
Juraj leads the technical development. Covering application development, data engineering, and data science. He studied pure and applied mathematics at the Czech Technical University in Prague. Juraj’s past experience includes Deloitte – as a financial modeler – and Deutsche Boerse as a software developer. Juraj is passionate about modern technologies and mathematical models.

Koen Michiels
Details coming soon…
Speakers:
TBD
Details coming soon…
Speakers:
Katarzyna Orzechowska
Mariusz Derela is DevOps Engineer at ING focused on security. As a member of Hunt Squad he is responsible for providing new solutions that can improve security processes in ING.
Details coming soon…
Speakers:
Dima Dermanskyi is a Data Engineering lead at WalkMe where he is responsible for development and operation of data-warehousing and computation infrastructure powering WalkMe’s analytics platform. He is obsessed with building data applications, and has a long record in development distributed systems in such domains as Telecom and e-commerce. Dima holds a master’s degree in computer science from Kyiv Polytechnic Institute.

Lunch, it’s time to grub!

Orchestrating Data for Machine Learning in Kubernetes

Details coming soon…
Speakers:
Frank Hu works as Tech Lead for Data Platform team at Bytedance US, with the focus on OLAP distributed systems. Before joining Bytedance, Frank led the Data Infrastructure team at Optimizely Inc and worked as the founding engineer at MailTime Inc. He holds a Bachelor of Engineering in Information Engineering from The Chinese University of Hong Kong.

Details coming soon…
Speakers:
Yang Che
Details coming soon…
Speakers:
TBD
Join us for some virtual festivities!

KEYNOTES

Details coming soon…
Speakers:
Ion Stoica is a Professor in the EECS Department at the University of California at Berkeley. He is currently the leader of RISELab. He does research on cloud computing and networked computer systems. Past work includes Apache Spark, Apache Mesos, Tachyon, Chord DHT, and Dynamic Packet State (DPS). He is an ACM Fellow and has received numerous awards, including the SIGOPS Hall of Fame Award (2015), the SIGCOMM Test of Time Award (2011), and the ACM doctoral dissertation award (2001). He is also a co-founder of Anyscale in 2019 to commercialize technologies for distributed Python especially for AI applications, a co-founder of Databricks in 2013 to commercialize technologies for Big Data processing, and a co-founder of Conviva Networks in 2006 to commercialize technologies for large scale video distribution.

Alluxio core maintainer Calvin Jia will share some of the hottest use cases in Alluxio 2 and discuss the future directions of the project being pioneered by Alluxio and the community.
Speakers:
Calvin Jia is the top contributor of the Alluxio project. He has been involved as a core maintainer and release manager since the early days when the project was known as Tachyon. Calvin has a B.S. from UC, Berkeley.
Details coming soon…
Speakers:
Sandipan Chakraborty works as Director of Engineering in the Global Data Office of Rakuten. He and his team are responsible for developing and maintaining the “Global Shared Data Analytics Platform” for Rakuten Group.  The platform today serves analytical and Data services to more than 80+ different businesses in Rakuten Group.  Sandi spent his last 20+ years in various aspects of Data including Data Integration, distributed systems, Big Data and BI.

Grab some coffee and get ready for what’s next!

HYBRID CLOUD ANALYTICS AND AI

Alluxio Product Manager Adit Madan introduces the newly launched Data Orchestration Hub, a management console that enables analytics or machine learning on data sources across regions to unify data lakes. Easy to use wizards connect compute engines, such as Presto or Spark, to data sources across data centers or from a public cloud to a private data center. The new service provides a central management view for configuration and monitoring.
Speakers:
Adit is a product manager at Alluxio. He is also a core maintainer and PMC member of the Alluxio Open Source project. Before joining Alluxio he was a research engineer at Hewlett-Packard Laboratories. His experience is in distributed systems, storage systems, and large scale data analytics. He has an M.S. from Carnegie Mellon University and a B.S. from IIT.

Describe benefits and methods Alluxio enables secure data access in the Comcast’s dx hybrid data cloud.
  • Review the data access challenges and tradeoffs in hybrid cloud
  • Review our hybrid architecture and the important role Alluxio plays
  • Provide performance metrics to highlight the benefits
Speakers:
Mike Fagan
Details coming soon…
Speakers:
TBD

Details coming soon…
Speakers:
TBD

Lunch, it’s time to grub!

High performance sql analytics

Details coming soon…
Speakers:
TBD

Details coming soon…
Speakers:
Jiawei Zhang
Yichuan Huang
Details coming soon…
Speakers:
Danny Linden

Details coming soon…
Speakers:
TBD

Details coming soon…
Speakers:
TBD

This talk introduces T3Go’s solution in building an enterprise-level data lake based on Apache Hudi & Alluxio,  and how to use Alluxio to accelerate the reading and writing of data on the data lake when compute and storage are segregated.
Speakers:
Trevor Zhang is a Big Data Sr Engineer at T3. His work at T3 focuses on data lake and the surrounding big data ecosystem. He has extensive experience in big data analysis and computing. Trevor is also a contributor to many open source projects including Apache Hudi, Apache Zeppelin, and Alluxio.
JD.com is one of the largest e-commerce corporations. In big data platform of JD.com, there are tens of thousands of nodes and tens of petabytes off-line data which require millions of spark and MapReduce jobs to process everyday. As the main query engine, thousands of machines work as Presto nodes and Presto plays an import role in the field of In-place analysis and BI tools. Meanwhile, Alluxio is deployed to improve the performance of Presto. The practice of Presto & Alluxio in JD.com benefits a lot of engineers and analysts.
Speakers:
Wenjun Tao
Thank you for joining us virtually this year!

SPONSORS

Sponsorship anchor

SPONSORSHIP

This event is a great opportunity to enhance your visibility as a thought leader, showcase your company’s technology leadership, reach potential customers, and recruit top technical talent.

Instant sponsor includes logo on website and acknowledgement in social channels. Click sponsor to access now. For all other tiers, please contact organizers@alluxio.com