Causal inference with geospatial data

permalink	/

Causal inference with geospatial data

Materials for a 4-hour hands-on workshop on causal inference with geospatial data. It is aimed at social scientists who are comfortable with regression but want a clearer way to think about causal claims, the assumptions behind them, and why spatial data make those assumptions harder to satisfy. It focuses on breadth and intuition.

The workshop is two short lectures plus a worked Python practical on a real case study: does livestock density raise ammonia (NH₃) concentrations?

The website for the workshop is here. The github repository is here.

One workshop backbone

Throughout the lectures and the practical we keep returning to the same five questions. This is the whole workshop in one checklist:

What is the treatment?
What is the estimand?
What is the comparison?
What assumption makes that comparison credible?
Why might that assumption fail?

Schedule and materials

Duration	Activity	Content	Link
45 min	Lecture 1	Counterfactuals, estimands, exogenous variation, causal designs	lecture 1
15 min	Break
40 min	Lecture 2	Spatial confounding, spillovers, scale, why spatial models are not causal designs	lecture 2
60 min	Practical	Maps and association → confounders and Moran's I → spatial models → DiD with farm gains → spillover-aware interpretation	practical notebook

The lectures are reveal.js slides — open the .html files directly in a browser, no setup required. Their source is in the matching .qmd files.

The practical

The practical is a worked example, not a hidden causal proof. Working from the single grid dataset, participants move through:

maps and descriptive association
controls and residual spatial clustering (Moran's I)
spatial lag / error / Durbin models
a difference-in-differences with farm-gain vs no-change cells (2020 → 2024)
why spillovers make that DiD fragile, and how to read a Spatial Durbin model

The takeaway: a map shows where, a regression shows what correlates, and a design tells you what would need to be true for a causal claim.

practical/practical_grid_nh3.ipynb is the main notebook used in the practical session — read it rendered online.
practical/practical_grid_nh3_butts.ipynb is an optional, more advanced notebook on design-based spillover DiD (far controls and distance rings) — read it rendered online.

Both practicals are also available as reactive marimo notebooks (practical/*.py) — see Quick start.

Quick start

You need uv, a fast Python package and environment manager. uv reads pyproject.toml and uv.lock and builds the exact environment automatically the first time you run something — there is no separate "create a venv" step.

1. Install uv

macOS:

curl -LsSf https://astral.sh/uv/install.sh | sh
# or, with Homebrew:  brew install uv

Linux (Ubuntu / Debian):

sudo apt update && sudo apt install -y curl git   # only if they are missing
curl -LsSf https://astral.sh/uv/install.sh | sh

Windows (PowerShell):

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

Alternatively, on any platform: pip install uv (or pipx install uv). See the uv install docs for details. Restart your terminal afterwards, then check it works:

uv --version

2. Get the materials

Clone the repository (or download it as a ZIP. and unzip):

git clone https://github.com/sodascience/workshop_geocausal.git
cd workshop_geocausal

3. Open the practical

From the project root, the recommended way is JupyterLab if you have no experience with marimo:

uv run jupyter lab practical/practical_grid_nh3.ipynb

The first run downloads and installs all dependencies (this can take a few minutes); later runs start instantly. JupyterLab opens in your browser — run the cells top to bottom.

Prefer the classic interface? Use uv run jupyter notebook instead of jupyter lab.

Like marimo instead?

uv run marimo edit practical/practical_grid_nh3.py

(marimo edit lets you run and change cells; uv run marimo run practical/practical_grid_nh3.py opens it read-only as an app.)

Data

The practical uses a single, ready-to-use file, data/final/workshop_grid_1km.csv:

workshop_grid_1km.csv — the Netherlands on a 1 × 1 km grid, with NH₃ concentrations (2018–2024), livestock and agricultural firm counts per cell, and neighbourhood covariates (population density, urbanity).

It was built from several sources:

RIVM NH₃ concentration maps,
the CBS Wijk- en Buurtkaart and Kerncijfers
Bureau van Dijk Orbis firm data. Orbis is proprietary, so only the aggregated per-cell counts appear in the shared file; the raw inputs and the data-building scripts are not redistributed.

Rebuilding the lectures (optional)

The rendered lecture HTML is already included. To rebuild from source you need Quarto and, for the DAG figures, the Python graphviz package plus the Graphviz dot system binary:

quarto render lectures/1_intro_causality/1_intro_causality.qmd --to revealjs
quarto render lectures/2_geocausality/2_geocausality.qmd --to revealjs

Contact

Developed and maintained by the ODISSEI Social Data Science (SoDa) team.

Questions? Email soda@odissei-data.nl, or contact the instructor Javier Garcia-Bernardo (j.garciabernardo@uu.nl) directly.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
_layouts		_layouts
assets/css		assets/css
data/final		data/final
img		img
lectures		lectures
practical		practical
.gitignore		.gitignore
Gemfile.example		Gemfile.example
LICENSE.txt		LICENSE.txt
README.md		README.md
_config.yml		_config.yml
pyproject.toml		pyproject.toml
render_practicals.sh		render_practicals.sh
render_slides.sh		render_slides.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Causal inference with geospatial data

One workshop backbone

Schedule and materials

The practical

Quick start

1. Install uv

2. Get the materials

3. Open the practical

Data

Further reading

Rebuilding the lectures (optional)

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Causal inference with geospatial data

One workshop backbone

Schedule and materials

The practical

Quick start

1. Install uv

2. Get the materials

3. Open the practical

Data

Further reading

Rebuilding the lectures (optional)

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages