ApacheCon NA 2015 has ended
Back To Schedule
Tuesday, April 14 • 5:20pm - 6:10pm
Data Stream Algorithms in Apache Storm and R - Radek Maciaszek, Data Mine Lab

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Streaming data presents new challenges for statistics and machine learning on extremely large data sets. Tools such as Apache Storm, a stream processing framework, can power range of data analytics but lack advanced statistical capabilities. In this talk I will discuss developing streaming algorithms with the flexibility of both Storm and R, a statistical programming language.

I will address the critical issues of why and how to use Storm and R to develop streaming algorithms; in particular I will focus on:
• Streaming algorithms
• Online machine learning algorithms
• Use cases showing how to process hundreds of millions of events a day in (near) real time


Radek Maciaszek

Data Mine Lab
I am a founder of Data Mine Lab, a big-data consultancy. The company specialises in large-scale data number crunching and cloud computing. Currently I work as a data scientist contractor with a London based hedge fund. I share my passion in data science by leading number of training... Read More →

Tuesday April 14, 2015 5:20pm - 6:10pm CDT
Texas II

Attendees (0)