Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Monday, June 5

6:00pm PDT

Let's talk SMACK!

Let's talk some SMACK! Join us for a fantastic evening with some of the experts of the SMACK stack technologies. For this meetup, we'll be doing things a little differently and running it like a live reddit AMA (ask me anything). We'll have a panelist of experts from the technologies in the SMACK stack (Apache Spark,Mesosphere/Apache Mesos, Akka, Apache Cassandra, ApacheKafka) to answer any and all questions you may have, so please come prepared with some good questions!


For full details (full meetup description, who the panelists are, etc) remember to RSVP HERE.  Hope to see you all there!

Monday June 5, 2017 6:00pm - 8:00pm PDT
Mesosphere 88 Stevenson St., San Francisco
Tuesday, June 6

6:00pm PDT

Spark in Adtech
Spark in Adtech meetup, run by SF Spark

Tuesday June 6, 2017 6:00pm - 9:00pm PDT
Chartboost 85 2nd St, San Francisco
Wednesday, June 7

1:00pm PDT

Scala Makers: Office Hours at Galvanize
These are office hours for Scala -- anybody can come and ask questions of any kind. Our experts work with Scala daily and teach it too. Stop by and ask! And at 4pm, walk over to the Leadership Kickoff at IBM Watson. Run as SF Scala meetup, RSVP there: https://www.meetup.com/SF-Scala/events/240552972/


Jacob Baumbach

Data Scientist at Capital One


Software Engineer at Performant Data
avatar for Sergei Winitzki

Sergei Winitzki

Senior Software Engineer, Workday Inc.
Theoretical physicist turned software engineer, passionate for functional programming, functional type theory, and declarative domain-specific languages.

Wednesday June 7, 2017 1:00pm - 4:00pm PDT
Galvanize 44 Tehama St.

3:00pm PDT

Data Science Bootcamp at Spark Summit: Analyze Data and Build a Dashboard with Spark, Notebooks and PixieDust


Working with Apache Spark and Jupyter Notebooks, but want to be more efficient? You need a dash of PixieDust, a new open source library. PixieDust speeds data manipulation and display with features like auto-visualization of Spark DataFrames, real-time Spark Job progress monitoring directly from the Notebook, seamless integration to cloud services, and automated local install of Python and Scala kernels running with Spark. 

In this two-hour workshop, IBM Distinguished Engineer David Taieb will walk through how to use PixieDust with Spark and Notebooks to analyze open data around traffic accidents in San Francisco, and then build charts and maps to discover insights. David will then show how to build a dashboard that drills down into specific areas and how to combine multiple data sources like crime or speeding zones to extract even more insights.

Regardless of your skill level with Spark and Notebooks, you'll be able to attend this session. Please bring your own network-enabled computer, as you will need to work out of a web browser for the hands-on portion of the workshop.

Refreshments will be served.

Wednesday June 7, 2017 3:00pm - 5:00pm PDT
Galvanize 44 Tehama St.

4:00pm PDT

The Future of Cognitive Business
This is the vision of the Cognitive Frameworks Festival events articulated by the industry leaders, driving AI adoption with real data, real customers, and real business, at global scale.

We run this event via Bay Area AI meetup.  You'll need to be registered there to attend!

We are fortunate to have industry leaders from

  • IBM
  • Salesforce
  • Cisco
  • Nvidia
  • Nitro
  • Lightbend
  • Mesosphere


4-5pm: BBQ party on the patio
5-6pm: Lightning talks from the industry leaders

6pm: Tehcnical Overvide by Sumit Gupta

From better fraud detection, to chat bots, to recommendation engines, deep learning is a new neural network method that was first popularized by its use in many everyday mobile services that we use like Google Voice, and Apple Siri. We will present the challenges that enterprises face in building real cognitive applications using deep learning based AI methods and discuss ways to solve them. We will also describe IBM's PowerAI deep learning software distribution that is an enterprise-ready binary distribution of popular frameworks like TensorFlow, CAFFE, Torch, Chainer, etc with support and services, that enables any business to get started with deep learning in minutes, rather than weeks.  

avatar for Mark Brewer

Mark Brewer

CEO of Lightbend, the company behind the Reactive Platform and open source frameworks like Akka, Play and Lagom.
avatar for Jose de Castro

Jose de Castro

CTO for APIs and Integrations, Collaboration Technology Group, Cisco
avatar for Simon Chan

Simon Chan

Senior Director, Einstein, Salesforce
@simonchannet Simon Chan is a Senior Director of Product Management for Salesforce Einstein where he oversees platform development and delivers products that empower anyone to build smarter apps with Salesforce. Simon is a product innovator and serial entrepreneur with more than... Read More →
avatar for Sumit Gupta

Sumit Gupta

VP, Cognitive Systems, IBM
Sumit Gupta is Vice President in the IBM Cognitive Systems, where he leads the product and business strategy for HPC, AI, and Analytics. He joined IBM two years ago from NVIDIA, where he was GM of the GPU accelerator business. 
avatar for Joel Horwitz

Joel Horwitz

Vice President, Ecosystem & Partnership Development, IBM
VP of Ecosystem and Partnerships, Digital Business, IBM
avatar for Devin Loftis

Devin Loftis

VP Engineering, ValiMail
avatar for Brad Murdoch

Brad Murdoch

VP, Corporate & Business Development, Lightbend
VP, Corporate & Business Development, Lightbend
avatar for Will Ramey

Will Ramey

Director, Developer Progams, NVIDIA
Happy to talk about deep learning and GPU-accelerated computing in general.
avatar for Tejada, Willie

Tejada, Willie

Chief Developer Advocate, Developer Ecosystems Group, IBM
As Chief Developer Advocate, Willie Tejada is responsible for IBM’s Developer Ecosystems Group – comprised of Digital Experience, Partnerships, Offer Management and Strategy. Developers are the catalyst for today’s software-enabled businesses, disruptive innovation and art... Read More →

Wednesday June 7, 2017 4:00pm - 7:00pm PDT
IBM West 505 Howard St, San Francisco

6:00pm PDT

Integrated Dataflow Processing with Spark and StreamSets
This event is run as the SF Hadoop meetup:


Join us for a meetup with StreamSets to discuss the latest Spark elements


6:00-6:30pm Food, drinks and networking

6:30-7:30pm Tech talk

7:30-8:00pm Networking 

Big data tools such as Hadoop and Spark allow you to process data at unprecedented scale, but keeping your processing engine fed can be a challenge. Metadata in upstream sources such as relational databases and log files can ‘drift’ due to infrastructure, OS and application changes, causing ETL tools and hand-coded solutions to fail. StreamSets Data Collector (SDC), is Apache 2.0 licensed open source software that allows data scientists and data engineers to build robust big data ingest pipelines using pre-built and custom processing stages via a browser-based UI.

In this session, Hari will explain how SDC integrates with Apache Spark, and how developers can create their own custom reusable processing elements using Spark’s programming model and existing libraries such as GraphX or MLLib. You'll learn how Spark can run SDC pipelines in a wide variety of environments, from standalone systems such as a developer's laptop, to on-premises and in-cloud clusters, allowing developers, data scientists and data engineers to process data at unprecedented scale.

Wednesday June 7, 2017 6:00pm - 8:00pm PDT
General Assembly 225 Bush St. (5th floor), San Francisco

6:00pm PDT

Next-generation Macros for Scala

This event is the SF Scala meetup run at Twitter.  Eventbrite registration is required, ilsted on the meetup!

This talk will present the design and the implementation of new-style macros based on Scalameta 2.0. Developed from the day one to be portable between Scala, Dotty and IntelliJ, new-style macros are planned to supersede existing macros based on scala.reflect. Join this meetup to be the first to try out the new macro system and provide early feedback to its authors.

Eugene is a member of the Scala language committee and founder of Scala macros and Scala meta. After recently getting his PhD from Martin Odersky at EPFL, he's been working at Twitter on open-source foundations of robust semantic tooling for Scala.


• 6:30-7:00 - networking 
• 7:00-7:10 - announcements etc. 
• 7:10-8:00 - talk 
• 8:00-8:30 - networking 

Wednesday June 7, 2017 6:00pm - 9:00pm PDT
Twitter 1355 Market St, San Francisco

7:00pm PDT

From R Script to Production Using rsparkling, Sparkling Water and sparklyr

Join us in San Francisco with Navdeep Gill, Hacker Scientist at H2O.ai for an evening of Spark + H2O + R! 


6:30 - 7:00 PM - Doors open and pizza

7:00 - 7:45 PM - Navdeep's talk  

7:45 - 8:15 PM - Q&A and networking

The rsparkling R package is an extension package for sparklyr (an R interface for Apache Spark) that creates an R front-end for the Sparkling Water Spark package from H2O. This provides an interface to H2O’s high performance, distributed machine learning algorithms on Spark, using R. The main purpose of this package is to provide a connector between sparklyr and H2O’s machine learning algorithms.

In this meetup, Navdeep will introduce the basic architectures of rsparkling, H2O Sparkling Water and sparklyr, and go over how these frameworks work together to build a cohesive machine learning framework. In addition, you’ll learn about various implementations for using rsparkling in production. The session will conclude with a live demo of rsparkling that will display an end-to-end use case of data ingestion, munging and machine learning.

Navdeep Gill is a Hacker Scientist at H2O.ai. He graduated from California State University, East Bay with a M.S. degree in Computational Statistics, B.S. in Statistics, and a B.A. in Psychology (minor in Mathematics). During his education he gained interests in machine learning, time series analysis, statistical computing, data mining, & data visualization.

Previous to H2O.ai he worked at a couple start ups and Cisco Systems, Inc. focusing on data science, software development, and marketing research. Before that, he was a consultant at FICO working with small to mid level banks in the U.S. & South America focusing on risk management.

Wednesday June 7, 2017 7:00pm - 8:15pm PDT
Capital One 201 3rd Street, 5th floor, San Francisco, CA
Thursday, June 8

9:00am PDT

Just Enough Scala for Spark
All-day SF Spark tutorial teaching you Scala for Apache Spark, in which it is written and whose Collections API it implements at scale.

avatar for Chaoran Yu

Chaoran Yu

Software Engineer, Lightbend Inc.

Thursday June 8, 2017 9:00am - 5:00pm PDT
Galvanize 44 Tehama St.

10:00am PDT

Introduction to Sparklyr
This session will cover what sparklyr is, and how it can be used to analyze, visualize and perform machine learning in Spark from R. We will walk through installation, configuration, data wrangling with SQL or dplyr, modeling in MLlib or H2O. You’ll then get a detailed update on new sparklyr features and changes in sparklyr 0.5. We will demonstrate how to model data using popular data frameworks in seamless integration between Spark and R.

avatar for Javier Luraschi

Javier Luraschi

Software Engineer, RStudio
Javier is the author of “Mastering Spark with R”, sparklyr, mlflow, pins and many other R packages for deep learning and data science. He holds a double degree in Math and Software Engineer and decades of industry experience with a focus on data analysis. He currently works in... Read More →

Thursday June 8, 2017 10:00am - 11:00am PDT
Galvanize 44 Tehama St.

6:00pm PDT

Spark, Watson, and Einstein


We have 5 talks for this amazing SF Spark meetup, showcasing the vibrant ecosystem around Apache Spark.

3 main talks, 20 minute each: 

PySpark Beyond Shuffling -- Why it isn't Magic -- but also where there is some really cool magic 
Holden Karau -- Apache Spark Committer, Spark Technology Center, IBM

Apache SystemML: State of the Project and Future Plans 
Frederick Reiss -- Chief Architect, Spark Technology Center, IBM

Kenneth Chen -- Cofounder of PredictionIO and Senior PM on Einstein, Salesforce

2 Lightning talks, about 10min each:

Hyperparameter Optimization -- when scikit-learn meets PySpark
Sven Hafeneger -- Software Engineer, IBM 

avatar for Kenneth Chan

Kenneth Chan

Senior Product Manager, Salesforce
Kenneth Chan is co-founder of PredictionIO (acquired by Salesforce and now Apache incubating project), and also contributed to various machine learning prediction engine templates in the community. He has over 10 years of engineering experience in multiple startups and building products... Read More →

Sven Hafeneger

Software Engineer, IBM
Software Developer | Data Scientist - Data Science Experience at IBM
avatar for Holden Karau

Holden Karau

Developer Advocate, Google
Holden Karau is a transgender Canadian open source developer advocate at Google focusing on Apache Spark, Beam, and related big data tools. Previously, she worked at IBM, Alpine, Databricks, Google (yes, this is her second time), Foursquare, and Amazon. Holden is the coauthor of Learning... Read More →
avatar for Fred Reiss

Fred Reiss

Chief Architect, IBM Spark Technology Center

Thursday June 8, 2017 6:00pm - 8:00pm PDT
IBM West 505 Howard St, San Francisco

6:00pm PDT

Watson Conversation with Node.JS

Chatbots have come a long way since AOL Instant Messenger, and they’re here to help! In this workshop, Nick will cover the basics of chatbot architecture.

Using Node.js and the Watson Conversation service, Nick will discuss dialog tree design, cognitive training models available for smarter UX, and logging API interactions for future analysis. 

The goal is to show attendees how you can build a cognitive chatbot, along with a plan for monitoring and analyzing application usage--all using cloud-based persistence and analytics services.

About the Speaker: Nick Bourdakos, Developer Advocate, IBM

Nick is a developer advocate intern for Watson Developer Cloud. He is currently a senior studying Computer Engineering at the University of Pittsburgh. His first language was Java and started out as an Android developer, but also knows Swift, Python and JavaScript. Nick enjoys painting in his free time.

Thursday June 8, 2017 6:00pm - 8:00pm PDT
Galvanize 44 Tehama St.

7:00pm PDT

Word Embeddings -- Past, Present and Future
This event is run at Bay Area AI meetup, presented and hosted by Uber.


Word Embeddings are both a hot research topic and a useful tool for NLP practitioners, as they provide representations used in many intermediate tasks, like part-of-speech tagging, syntactic parsing or named entity recognition, as well as end to end tasks like text classification, sentiment analysis and question answering.

The recent attention to this topic started in 2013 when the original word2vec paper was published at NIPS alongside with an efficient and scalable implementation, but a lot of research was carried out on the topic since the 50s in computer science, cognitive science, and computational linguistics. 
The Historical part of the talk will focus on this body of work, with the aim of distilling ideas and learned lessons many practitioners and machine learning researchers may not be unaware of.

The second part of the talk will focus on recent developments and novel methods, highlighting interesting directions that are being explored lately, like the compositionality of meaning, representing words as probability distributions and how to learn representations of knowledge graphs.

avatar for Piero Molino

Piero Molino

Research Scientist, Uber AI Labs
Machine Learning researcher at Uber AI Labs with focus on language. Completed a PhD on Question Answering at the University of Bari, Italy. Founded QuestionCube, a startup that built a framework for semantic search and QA. Worked for Yahoo Labs in Barcelona on learning to rank, IBM... Read More →

Thursday June 8, 2017 7:00pm - 8:30pm PDT
Uber 1455 Market St., San Francisco
Friday, June 9

9:00am PDT

Scala from the Ground Up

This concentrated Scala course from the SF Scala co-organizer and O'Reilly author of "Learning Scala" Jason Swartz will get you from 0 to 1!

This event is run as SF Scala meetup hosted by Galvanize, and requires Eventbrite registration -- details to follow here. 

avatar for Jason Swartz

Jason Swartz

Software Developer, Twitch
Building the next generation of scalable edge services at Twitch. Author of Learning Scala (O'Reilly Media, 2014)

Friday June 9, 2017 9:00am - 12:00pm PDT
Galvanize 44 Tehama St.

10:00am PDT

Deep Learning in Production with Deeplearning4j

DeepLearning4J (Deep Learning for Java - DL4J, inception 2013) was specifically designed with Enterprise and Production in mind, as a first-class citizen to the JVM.  The DL4J Stack was designed to integrate well with other components of the Big Data Ecosystem, with the ability to scale.  While there do exist Open Source components, Skymind also develops an additional layer, the Skymind Intelligence Layer (SKIL) as part of the bundled vendor distribution.  For performance functionality, some of the underlying operations of the stack are written natively in C++.


  • Intro to Skymind/DL4J, Core Components, and Framework Capabilities
  • How to create/setup the environment & executing our examples
  • DL4J Deep Learning Workflow + Keras Model Import
  • DL4J Network Demos: VGG-16 Classifier, etc...
  • Distributed Parallel Training on Multi-GPUs via Apache Spark
  • Select Contrasts with Deep Learning Framework

avatar for Ari Kamlani

Ari Kamlani

Principal Data Scientist, ThoughtWorks
Data Scientist and Technology Strategist & Advisor, currently employed as a Deep Learning Consultant with Skymind and Technologist in Residence (TIR) with Techstars IoT. Previously a Data Scientist & Engineering Consultant at Otto (Tyto) for the Connected Home and Research Assistant... Read More →

Friday June 9, 2017 10:00am - 12:00pm PDT
Galvanize 44 Tehama St.

1:00pm PDT

Charting Collections of Connections in Social Media: Creating Maps and Measures with NodeXL

Networks are a data structure commonly found in any social media service that allows populations to author collections of connections.  The Social Media Research Foundation's NodeXL project makes analysis of social media networks accessible to most users of the Excel spreadsheet application.  With NodeXL, network charts become as easy to create as pie charts.  Recent research created by applying the tool to a range of social media networks has already revealed the variations in network structures present in online social spaces.  A review of the tool and images of Twitter, flickr, YouTube, Facebook and email networks will be presented. 

Description: We now live in a sea of tweets, posts, blogs, and updates coming from a significant fraction of the people in the connected world.  Our personal and professional relationships are now made up as much of texts, emails, phone calls, photos, videos, documents, slides, and game play as by face-to-face interactions.  Social media can be a bewildering stream of comments, a daunting fire hose of content. With better tools and a few key concepts from the social sciences, the social media swarm of favorites, comments, tags, likes, ratings, updates and links can be brought into clearer focus to reveal key people, topics and sub-communities.  As more social interactions move through machine-readable data sets new insights and illustrations of human relationships and organizations become possible.  But new forms of data require new tools to collect, analyze, and communicate insights.  

avatar for Marc Smith

Marc Smith

Director, Social Media Research Foundation
Marc Smith is a sociologist specializing in the social organization of online communities and computer mediated interaction. Smith leads the Connected Action consulting group and lives and works in Silicon Valley, California. Smith co-founded and directsthe Social Media Research... Read More →

Friday June 9, 2017 1:00pm - 2:00pm PDT
Galvanize 44 Tehama St.

1:00pm PDT

Optimizing and Deploying Spark and TensorFlow Models with GPUs

Using the latest advancements from TensorFlow including the Accelerated Linear Algebra (XLA) Framework, JIT/AOT Compiler, and Graph Transform Tool , I’ll demonstrate how to optimize, profile, and deploy TensorFlow Models in GPU-based production environment.
This talk is contains many Spark ML and TensorFlow AI demos using PipelineIO's 100% Open Source Community Edition.  All code and Docker images are available to reproduce on your own CPU or GPU-based cluster.

avatar for Chris Fregly

Chris Fregly

Developer Advocate, AI and Machine Learning, AWS

Friday June 9, 2017 1:00pm - 5:00pm PDT
Galvanize 44 Tehama St.

2:00pm PDT

Product-Managing AI
Product Managers need to thoroughly understand AI to build AI-enabled products and businesses.  Mike Feng, an experienced startup product executive, will share his insights from years of building ML-enabled businesses.

avatar for Michael Feng

Michael Feng

Michael Feng was the CEO and co-founder of doxIQ, a Data Collective-backed startup that utilized computer vision to extract structured data from PDF files. In 2015, doxIQ was acquired by Nitro, where Michael led the product management team and served as product manager for the flagship... Read More →

Friday June 9, 2017 2:00pm - 3:00pm PDT
Galvanize 44 Tehama St.

3:00pm PDT

Building a fully open-source stack to analyze the code of +60mm software projects using Deep Learning
In this talk we show the fully open source stack that has been built to discover and fetch all of the world's public git repositories, turn the code into a universal AST, the kinds of a neural network architectures that perform well on source code, the tools that have been built to deal with such a large and unique data set and the future work that needs to be done. We also talk about the impact of AI on code on the future of programming and building software.

avatar for Eiso Kant

Eiso Kant

Founder and CEO, Source{d}
Eiso Kant is the CEO & Co-Founder of source{d}, a company focused on building an open-source stack for ML/AI on code. He has been involved in founding tech companies for the last 10 years. Eiso has a passion for programming, machine learning, AGI and cryptography.

Friday June 9, 2017 3:00pm - 4:00pm PDT
Galvanize 44 Tehama St.

4:00pm PDT

Cisco Spark: Open Platform for Collaborative AI

This event is run as a Bay Area AI meetup:


Learn how Artificial Intelligence (AI) is transforming the collaboration experience and enabling new cognitive capabilities. Cisco Spark’s open platform allows you to build custom apps, intelligent conversational bots, and workflow integrations to drive the workplace of the future. This session will focus on:

  • Current and future trends of of conversational chat and voice bots
  • How AI-powered collaboration solutions will change the workplace

avatar for Jose de Castro

Jose de Castro

CTO for APIs and Integrations, Collaboration Technology Group, Cisco

Friday June 9, 2017 4:00pm - 5:00pm PDT
Galvanize 44 Tehama St.

5:00pm PDT

The Change AI Needs, not the one it Wants
AI and deep learning have taken the world by storm, especially Silicon Valley, with companies in almost every industry creating positions in the field. Currently, there is a sizeable gap between what AI promises to do for companies and what it actually is doing for them. I want to take a deeper look into the state of AI in industry. This talk will address some of the problems and challenges that AI has, how it can help industries when it is working properly, and how to help get AI from where it is today to where it has the potential to be more quickly.

avatar for Pedro Alves

Pedro Alves

Founder and CEO, Ople
Pedro has experience in predicting, analyzing and visualizing data in the fields of: genomics, gene networks, cancer metastasis, insurance fraud/costs, hospital readmissions, soccer strategies, joint injuries, social graphs, human attraction, spam detection, topic modeling and computer... Read More →

Friday June 9, 2017 5:00pm - 6:00pm PDT
Galvanize 44 Tehama St.
Filter sessions
Apply filters to sessions.