Skip to content
Caitlin Casar, PhD edited this page Aug 2, 2022 · 4 revisions

Intro to Apache Spark with 84.51°

Welcome to the Intro to Apache Spark with 84.51° workshop hosted by Northwestern University IT Research Computing Services! In this 3 hour workshop, we'll cover fundamental Spark concepts, basic cloud computing with Azure Databricks, and exercises demonstrating the power of Spark!

Prerequisites: The workshop will assume that you're familiar with basic Python and SQL, as well as developing in a notebook (Jupyter).

Workshop Agenda

  • Intro to Spark and the Databricks UI (30 mins)
  • 10 min break
  • Guided Exercises Part I (25 mins)
  • 5 min break
  • Guided Exercises Part II (25 mins)
  • 10 min break
  • Individual Exercises (1 hour)

Resources

  • Read about the data we're using for the workshop here!
  • Check out our conceptual Spark slides here!

Clone this wiki locally