Causal Diagrams: Five Elementary Structures

Motivating example: the Salk vaccine

In 1954, investigators first evaluated the polio vaccine with an observational comparison: children whose parents consented versus children whose parents did not.

That comparison suggested higher polio rates among vaccinated children. The design was confounded.

The reversal

A large randomised, double-blind trial assigned vaccine or placebo by chance. The conclusion reversed: the vaccine reduced paralytic polio.

Same question. Different assignment mechanism. Different answer.

Why this week matters

In Week 1 we defined causal questions informally. This week we learn how to represent structural assumptions as directed acyclic graphs (DAGs).

A DAG does not create causal knowledge. It makes assumptions explicit, checkable, and discussable.

Randomisation and exchangeability

Under random assignment, potential outcomes are independent of treatment:

Y(a) \coprod A.

This is unconditional exchangeability. Under this condition, a simple difference in means identifies the ATE:

\widehat{\text{ATE}}=\hat{\mathbb{E}}[Y\mid A=1]-\hat{\mathbb{E}}[Y\mid A=0].

Working definitions

Term	Definition
Internal validity	The study contrast estimates the target causal contrast in the study population
External validity	That causal contrast transports to the target population
Confounding bias	At least one backdoor path from A to Y remains open

These are design properties, not instrument properties.

DAG notation

DAG elements

A: treatment or exposure
Y: outcome
Y(a): potential outcome under intervention level a
L: measured confounder set
U: unmeasured cause
M: mediator

Independence language

Symbol	Meaning
A \coprod Y(a)	Independence
A \cancel\coprod Y(a)	Dependence
A \coprod Y(a) \mid L	Conditional independence given L

Five elementary structures

No causal relation: A \coprod B
Direct causation: A \to B
Fork (common cause): A \leftarrow C \to B
Chain (mediation): A \to C \to B
Collider: A \to C \leftarrow B

Three identification assumptions

Causal consistency: If person i receives A_i = a, then Y_i = Y_i(a).

Conditional exchangeability:

Y(a) \coprod A \mid L.

Positivity:

P(A = a \mid L = l) > 0.

Four rules of confounding control

Condition on common causes (or defensible proxies)
Do not condition on mediators when estimating total effects
Do not condition on colliders
Treat descendants carefully: conditioning can transmit bias

Return to the opening example

The Salk reversal was not a regression trick. It was a structural lesson about the assignment mechanism.

Define the question (does the vaccine prevent polio?)
Draw assumptions (what affects both treatment and outcome?)
Choose an adjustment set (or randomise to avoid the problem)
Estimate

Readings

Required and optional readings for each week are listed on the course readings page.