ApacheCon NA 2015 has ended
Back To Schedule
Thursday, April 16 • 10:00am - 10:50am
Hive Now Sparks - Chao Sun, Cloudera

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Hive has become de facto standar SQL on big data in Hadoop ecosystem. With its open architecture and backend neutrality, Hive queries can run on MapReduce and Tez. On the other hand, Apache Spark as an open-source data analytics cluster computing framework has gained significant momentum recently. Marrying the two, that is, providing a new execution engine to Hive, has many benefits for Spark users and Hive users.
Hive on Spark (HIVE-7292) is probably the most watched project in Hive with 100+ watchers. The effort has attracted developers from both communities, around globe, and from brand companies such as Intel, IBM, Cloudera, and MapR. This presentation will talk about the motivation, design principles, architecture, challenges, and current status of the project followed by a live demo.


Chao Sun

Chao Sun is currently a Software Engineer at Cloudera, Inc. He has been working on Hive on Spark project since joining the company in mid 2014. Prior to that, he was a PhD student in Computer Science at U​W-Milwaukee, focusing on type systems​ and ​mechanized proofs​.​

Thursday April 16, 2015 10:00am - 10:50am CDT
Texas VI

Attendees (0)