Apache Hadoop and its large family of related projects is a critical part of large scale Big Data analytics in the cloud. In this talk we will discuss feedback from users of our Apache Hadoop based service HDinsight. We will pull back the covers on this service and delve into how we’ve integrated open source components such as Hadoop, Pig, Hive, Oozie, Tez, Zookeeper and others.
Users of HDinsight have provided valuable feedback which has led our team to contribute directly to some of these project communities. As you would expect our users have also benefited from the contributions of others. We’ll look at some of our contributions to projects like Hadoop, Hive and YARN and, more importantly, we’ll examine how you might leverage this work in your Hadoop workloads.
Finally, we’ll look at how our partnerships with other community members have resulted in both easier deployment of Big Data apps to the cloud and accelerated innovation in these Apache projects.