Loading…
ApacheCon NA 2015 has ended

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Science [clear filter]
Monday, April 13
 

10:45am

Getting Started with Apache OODT - Tom Barber, Meteorite Consulting
Apache OODT is a modular distributed data processing framework aimed to help collect, process and catalogue data.
In our getting started tutorial, we will take a look at OODT, its various modules, how to build and deploy OODT using Radix or Docker, and how to modify and extend OODT to fit your data processing and storage needs. We will also look at how to process data, and distribute it across multiple servers.

Speakers
avatar for Tom Barber

Tom Barber

Technical Director, Spicule LTD
Tom Barber is the director of Meteorite BI and Spicule BI. A member of the Apache Software Foundation and regular speaker at ApacheCon, Tom has a passion for simplifying technology. The creator of Saiku Analytics and open source stalwart, when not working for NASA, Tom currently deals... Read More →


Monday April 13, 2015 10:45am - 11:35am
Texas II

2:00pm

Apache Tika: Cool Insights into Polar Data - Annie Burgess, USC
Climate change is amplified in the Polar Regions. Polar amplification is captured via space and airborne remote sensing, in-situ measurement, and climate modeling. While simply finding these data is often a challenge, this talk will focus on what to do with the data (and metadata) once it is found! Here we present our current efforts using Apache Tika to help us ask some big questions about Arctic and Antarctic data. Apache Tika is an open source framework for metadata exploration, automatic text mining, and information retrieval. Over the past year, we have expanded Apache Tika to parse, extract, and analyze common data formats used in Arctic and Antarctic research making them more easily accessible, searchable, and retrievable by all major content management systems. Come to this talk to hear about how we’ve expanded Tika and what cool new insights we have into polar data!

Speakers
AB

Annie Burgess

Lab Director, ESIP


Monday April 13, 2015 2:00pm - 2:50pm
Texas II
 
Tuesday, April 14
 

11:40am

Userfriendly Workflows with Apache OODT - Tom Barber, Meteorite Consulting
Apache OODT is a data processing platform that has a number of modules, when ingesting data or once it has been ingested you can then process it through OODT workflows.

Workflows are the OODT data transformation pipeline and allow you to pre or post process the data.

We'll be looking at OODT workflows, how to build them, extend them, deploy them and optimise them so that they can be used by the wider community to help process data in large scale and small scale data processing pipelines.

Speakers
avatar for Tom Barber

Tom Barber

Technical Director, Spicule LTD
Tom Barber is the director of Meteorite BI and Spicule BI. A member of the Apache Software Foundation and regular speaker at ApacheCon, Tom has a passion for simplifying technology. The creator of Saiku Analytics and open source stalwart, when not working for NASA, Tom currently deals... Read More →


Tuesday April 14, 2015 11:40am - 12:30pm
Texas II

2:00pm

Pharmacovigilance - Big Data for RealTime Drug Monitoring - Pei Chen, Apache cTAKES & Jay Vyas, Red Hat
Real Time Drug Safety Monitoring in the Cloud - Collecting and harnessing knowledge from large public data sources in real time to monitor and detect adverse drug effects. In this presentation, Pei and Jay will demonstrate an entire system using Apache BigTop, OpenStack, Spark, cTAKES, Cassandra to proactively monitor and detect adverse drug events from Twitter data.

Speakers
PC

Pei Chen

Pei Chen is VP of the Apache cTAKES project. He is also a lead application development specialist at the Informatics Program at Boston Children’s Hospital/Harvard Medical School and Co-Founder of Wired Informatics. Mr. Pei’s interests lie in building practical applications... Read More →
avatar for Jay Vyas

Jay Vyas

Cloud Native Engineering, Vmware
Jay Vyas is a Kubernetes Engineer at VMWare, and has worked on K8s at its inception in 2015 as an open source project. Jay has also worked on the business side of Kubernetes, moving Blackducksoftware\'s 100 milion dollar business to an 80% SaaS based model, entirely powered by Kubernetes... Read More →


Tuesday April 14, 2015 2:00pm - 2:50pm
Texas II

5:20pm

Data Stream Algorithms in Apache Storm and R - Radek Maciaszek, Data Mine Lab
Streaming data presents new challenges for statistics and machine learning on extremely large data sets. Tools such as Apache Storm, a stream processing framework, can power range of data analytics but lack advanced statistical capabilities. In this talk I will discuss developing streaming algorithms with the flexibility of both Storm and R, a statistical programming language.

I will address the critical issues of why and how to use Storm and R to develop streaming algorithms; in particular I will focus on:
• Streaming algorithms
• Online machine learning algorithms
• Use cases showing how to process hundreds of millions of events a day in (near) real time

Speakers
RM

Radek Maciaszek

Data Mine Lab
I am a founder of Data Mine Lab, a big-data consultancy. The company specialises in large-scale data number crunching and cloud computing. Currently I work as a data scientist contractor with a London based hedge fund. I share my passion in data science by leading number of training... Read More →


Tuesday April 14, 2015 5:20pm - 6:10pm
Texas II
 
Wednesday, April 15
 

9:00am

Apache Airavata Overview and Roadmap - Suresh Marru, Apache Software Foundation
Apache Airavata is software for providing services to manage scientific applications on a wide range of remote computing resources. Airavata can be used by both individual scientists to run scientific workflows as well as communities of scientists through Web browser interfaces. Airavata is composed of several components (Registry, Orchestrator, Application Factory, Workflow Interpreter, Messenger, Credential Store) that implement these capabilities. Airavata community is in the process of rearchitecting Airavata software to serve as the basis of a multi-tenanted, elastically scalable, fault-tolerant Platform as a Service for our community. This introduces several challenges to the current architecture as well as opportunities to leverage and collaborate with other Apache projects. We discuss these experiences and future directions.

Speakers
avatar for Suresh Marru

Suresh Marru

Member, Indiana University
Suresh Marru is a Member of the Apache Software Foundation and is the current PMC chair of the Apache Airavata project. He is the deputy director of Science Gateways Research Center at Indiana University. Suresh focuses on research topics at the intersection of application domain... Read More →


Wednesday April 15, 2015 9:00am - 9:50am
Texas II