ApacheCon NA 2015 has ended
Back To Schedule
Thursday, April 16 • 9:00am - 9:50am
Apache Spark in 2015 and Beyond - Reynold Xin, Databricks

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

In this talk, I will give a quick introduction to Apache Spark, one of the most widely used cluster compute engine and Big Data framework. I will cover some of the important developments in the project, including:
  • our efforts to scale up Spark, which enabled us to set a new world record in 100TB sorting, beating the previous Hadoop MapReduce record by 3X using 1/10 of the nodes.
  • our efforts to expand the Spark API to make it easier to use for data scientists and application developers
  • and last but not least, a number of efforts including Spark Packages aimed at facilitating better community contribution at scale


Reynold Xin

co-founder of Databricks
Reynold is a co-founder and the Chief Architect of Databricks.

Thursday April 16, 2015 9:00am - 9:50am CDT
Texas VI

Attendees (0)