ApacheCon NA 2015 has ended
Back To Schedule
Wednesday, April 15 • 10:00am - 10:50am
From MapReduce to Spark with Apache Crunch - Micah Whitacre, Cerner Corporation

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

With companies having made heavy investments in MapReduce the emergence of Apache Spark as a new processing platform is both tempting and daunting. Refactoring code or altering processing steps can be a significant investment. The Apache Crunch project can help with the transition utilizing its built in support for reusing code in both execution environments. Teams can make incrementally migrate their processing workflows or utilize the appropriate execution engine depending on their use case while still utilizing a common set of concepts provided by Apache Crunch. The presentation will cover the basics of Apache Spark, how to reuse the same code in both MapReduce and Spark, as well as differences with using Apache Crunch over plain Apache Spark.

avatar for Micah Whitacre

Micah Whitacre

Software Architect, Cerner Corporation
Micah is a committer on the Apache Crunch project as well as a Software Architect for Cerner Corporation, a leading provider of healthcare technology. For almost a decade he has worked on building infrastructure and reusable assets. In the last few years his focus has shifted towards... Read More →

Wednesday April 15, 2015 10:00am - 10:50am CDT
Texas VI

Attendees (0)