ApacheCon NA 2015 has ended
Back To Schedule
Tuesday, April 14 • 11:40am - 12:30pm
Pulsar: Realtime Analytics at Scale Leveraging Kafka, Hadoop and Kylin - Tony Ng, eBay

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Enterprises are Increasingly demanding realtime analytics and insights to power use cases like personalization, monitoring and marketing. We will present Pulsar, a realtime streaming system used at eBay which can scale to millions of events per second with high availability and SQL-like language support, enabling realtime data enrichment, filtering and multi-dimensional metrics aggregation.

We will discuss how Pulsar integrates with a number of open source Apache technologies like Kafka, Hadoop and Kylin (Apache incubator) to achieve the high scalability, availability and flexibility. We use Kafka to replay unprocessed events to avoid data loss and to stream realtime events into Hadoop enabling reconciliation of data between realtime and batch. We use Kylin to provide multi-dimensional OLAP capabilities.


Tony Ng

Tony Ng is a Director of Engineering at eBay, Inc where he leads the User Behavior Analytics, Experimentation and Marketing Platform products. At eBay, Tony has been involved in building eBay's core platforms and services, including cloud, big data analytics, real-time streaming... Read More →

Tuesday April 14, 2015 11:40am - 12:30pm CDT
Texas V

Attendees (0)