Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Wednesday, April 15 • 10:00am - 10:50am
From MapReduce to Spark with Apache Crunch - Micah Whitacre, Cerner Corporation

Sign up or log in to save this to your schedule and see who's attending!

With companies having made heavy investments in MapReduce the emergence of Apache Spark as a new processing platform is both tempting and daunting. Refactoring code or altering processing steps can be a significant investment. The Apache Crunch project can help with the transition utilizing its built in support for reusing code in both execution environments. Teams can make incrementally migrate their processing workflows or utilize the appropriate execution engine depending on their use case while still utilizing a common set of concepts provided by Apache Crunch. The presentation will cover the basics of Apache Spark, how to reuse the same code in both MapReduce and Spark, as well as differences with using Apache Crunch over plain Apache Spark.

Speakers
avatar for Micah Whitacre

Micah Whitacre

Software Architect, Cerner Corporation
Micah is a committer on the Apache Crunch project as well as a Software Architect for Cerner Corporation, a leading provider of healthcare technology. For almost a decade he has worked on building infrastructure and reusable assets. In the last few years his focus has shifted towards enabling the adoption of Big Data technologies at Cerner helping to build infrastructure for ingestion of Big Data and efficient processing in both a batch and... Read More →


Wednesday April 15, 2015 10:00am - 10:50am
Texas VI