Back To Schedule
Wednesday, June 7 • 3:00pm - 5:00pm
Data Science Bootcamp at Spark Summit: Analyze Data and Build a Dashboard with Spark, Notebooks and PixieDust

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.


Working with Apache Spark and Jupyter Notebooks, but want to be more efficient? You need a dash of PixieDust, a new open source library. PixieDust speeds data manipulation and display with features like auto-visualization of Spark DataFrames, real-time Spark Job progress monitoring directly from the Notebook, seamless integration to cloud services, and automated local install of Python and Scala kernels running with Spark. 

In this two-hour workshop, IBM Distinguished Engineer David Taieb will walk through how to use PixieDust with Spark and Notebooks to analyze open data around traffic accidents in San Francisco, and then build charts and maps to discover insights. David will then show how to build a dashboard that drills down into specific areas and how to combine multiple data sources like crime or speeding zones to extract even more insights.

Regardless of your skill level with Spark and Notebooks, you'll be able to attend this session. Please bring your own network-enabled computer, as you will need to work out of a web browser for the hands-on portion of the workshop.

Refreshments will be served.

Wednesday June 7, 2017 3:00pm - 5:00pm PDT
Galvanize 44 Tehama St.