Causal Diagrams: Five Elementary Structures

PSYC 434 — Week 2

Motivating example: the Salk vaccine

In 1954, investigators first evaluated the polio vaccine with an observational comparison: children whose parents consented versus children whose parents did not.

That comparison suggested higher polio rates among vaccinated children. The design was confounded.

The reversal

A large randomised, double-blind trial assigned vaccine or placebo by chance. The conclusion reversed: the vaccine reduced paralytic polio.

Same question. Different assignment mechanism. Different answer.

Why this week matters

In Week 1 we defined causal questions informally. This week we learn how to represent structural assumptions as directed acyclic graphs (DAGs).

A DAG does not create causal knowledge. It makes assumptions explicit, checkable, and discussable.

Randomisation and exchangeability

Under random assignment, potential outcomes are independent of treatment:

Y(a) \coprod A.

This is unconditional exchangeability. Under this condition, a simple difference in means identifies the ATE:

\widehat{\text{ATE}}=\hat{\mathbb{E}}[Y\mid A=1]-\hat{\mathbb{E}}[Y\mid A=0].

Working definitions

Term Definition
Internal validity The study contrast estimates the target causal contrast in the study population
External validity That causal contrast transports to the target population
Confounding bias At least one backdoor path from A to Y remains open

These are design properties, not instrument properties.

DAG notation

DAG notation conventions

DAG elements

General DAG terminology
  • A: treatment or exposure
  • Y: outcome
  • Y(a): potential outcome under intervention level a
  • L: measured confounder set
  • U: unmeasured cause
  • M: mediator

Independence language

Symbol Meaning
A \coprod Y(a) Independence
A \cancel\coprod Y(a) Dependence
A \coprod Y(a) \mid L Conditional independence given L

Five elementary structures

Five elementary DAG structures
  1. No causal relation: A \coprod B
  2. Direct causation: A \to B
  3. Fork (common cause): A \leftarrow C \to B
  4. Chain (mediation): A \to C \to B
  5. Collider: A \to C \leftarrow B

Three identification assumptions

Causal consistency: If person i receives A_i = a, then Y_i = Y_i(a).

Conditional exchangeability:

Y(a) \coprod A \mid L.

Positivity:

P(A = a \mid L = l) > 0.

Four rules of confounding control

Confounding structures
  1. Condition on common causes (or defensible proxies)
  2. Do not condition on mediators when estimating total effects
  3. Do not condition on colliders
  4. Treat descendants carefully: conditioning can transmit bias

Return to the opening example

The Salk reversal was not a regression trick. It was a structural lesson about the assignment mechanism.

  1. Define the question (does the vaccine prevent polio?)
  2. Draw assumptions (what affects both treatment and outcome?)
  3. Choose an adjustment set (or randomise to avoid the problem)
  4. Estimate

Readings

Required and optional readings for each week are listed on the course readings page.