Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Thursday, April 16 • 10:00am - 10:50am
Hive Now Sparks - Chao Sun, Cloudera

Sign up or log in to save this to your schedule and see who's attending!

Apache Hive has become de facto standar SQL on big data in Hadoop ecosystem. With its open architecture and backend neutrality, Hive queries can run on MapReduce and Tez. On the other hand, Apache Spark as an open-source data analytics cluster computing framework has gained significant momentum recently. Marrying the two, that is, providing a new execution engine to Hive, has many benefits for Spark users and Hive users.
Hive on Spark (HIVE-7292) is probably the most watched project in Hive with 100+ watchers. The effort has attracted developers from both communities, around globe, and from brand companies such as Intel, IBM, Cloudera, and MapR. This presentation will talk about the motivation, design principles, architecture, challenges, and current status of the project followed by a live demo.

Speakers
CS

Chao Sun

Chao Sun is currently a Software Engineer at Cloudera, Inc. He has been working on Hive on Spark project since joining the company in mid 2014. Prior to that, he was a PhD student in Computer Science at U​W-Milwaukee, focusing on type systems​ and ​mechanized proofs​.​


Thursday April 16, 2015 10:00am - 10:50am
Texas VI

Attendees (27)