The Causal Workflow: Ten Steps

This page summarises a ten-step framework for conducting causal inference with observational data. Each step addresses a potential threat to valid causal interpretation. The framework draws on Hernan and Robins (What If, 2024), VanderWeele (2022), and Bulbulia (2024).

How to use this page

This is a reference resource. Use it as a checklist when planning your research report. Each step links back to the lecture where it was introduced.

Step 0: Define the target population

Say exactly who the answer is meant to inform before choosing the exposure or outcomes. A causal question about "New Zealanders" differs from one about "adults for whom weekly religious-service attendance is a meaningful option" or "working-age adults for whom volunteering is feasible." The population choice shapes which interventions are coherent, which outcomes matter, and where positivity may fail.

Eligibility rules define the source population, but sampling and participation can yield a study population with a different distribution of effect modifiers. If you intend to generalise beyond the source population (transportability), articulate the additional conditions required.

See Week 6 for how effect modification interacts with population composition.

Step 1: State a well-defined treatment

Specify the hypothetical intervention precisely enough that every member of the target population could, in principle, receive it. "Mindfulness" is too vague because people meditate with different apps, in groups, alone, once, or every day. A clearer intervention is: "start a guided mindfulness app at the beginning of semester and complete one 10-minute session per day for eight weeks."

Precision here underwrites the causal consistency assumption (Step 5). If the treatment is vaguely defined, different people effectively receive different interventions, and the potential outcome $Y(a)$ is not well-defined. It also makes the relevant time origin visible.

See Week 5 for the formal definition of causal consistency.

Step 2: Establish time zero

Define the point at which treatment assignment begins and follow-up starts. We cannot do this until the treatment is specified, because time zero is defined relative to treatment assignment or initiation.

See Week 5 for why ambiguous time zero distorts target trial emulation.

Step 3: State a well-defined outcome

Define the outcome so the causal contrast is meaningful and temporally anchored. "Sense of Purpose" is underspecified; "psychological distress one year post-intervention, measured with the Kessler-6" is interpretable and reproducible. Include timing, scale, and instrument.

See Week 4 for measurement from the vantage point of causal inference, and Week 10 for the problems with classical test theory.

Step 4: Evaluate exchangeability

Make the case that potential outcomes are independent of treatment conditional on covariates: $Y(a) \coprod A \mid X$. Use DAGs, subject-matter arguments, pre-treatment covariate balance checks, and overlap diagnostics. If exchangeability is doubtful, redesign (e.g., stronger measurement, alternative identification strategies) rather than rely solely on modelling.

See Week 5 for the formal definition of exchangeability.

Step 5: Ensure causal consistency

Consistency requires that, for individuals receiving a treatment version compatible with level $a$, the observed outcome equals $Y(a)$. It also presumes well-defined versions and no interference between units. When multiple versions exist, either refine the intervention so versions are irrelevant to $Y(a)$, or condition on version-defining covariates.

See Week 5 for examples of consistency violations.

Step 6: Check positivity (overlap)

Each treatment level must occur with non-zero probability at every covariate profile needed for exchangeability:

$$P(A = a \mid L = l) > 0$$

Diagnose limited overlap using propensity score distributions or extreme weights. Consider design-stage remedies (trimming, restriction, adaptive sampling) before estimation.

See Week 5 for the formal positivity assumption.

Step 7: Ensure measurement aligns with the scientific question

Verify that constructs are captured by instruments whose error structures do not distort the causal contrast of interest. Be explicit about forms of measurement error (classical, Berkson, differential, misclassification) and their structural implications for bias.

See Week 4 for measurement error in causal DAGs, and Week 10 for classical measurement theory from a causal perspective.

Step 8: Preserve representativeness

End-of-study analyses should reflect the target population's distribution of effect modifiers. Differential attrition, non-response, or measurement processes tied to treatment and outcomes can induce selection bias. Plan strategies such as inverse probability weighting for censoring, multiple imputation, and sensitivity analyses for missing-not-at-random data.

See Week 4 for selection bias structures.

Step 9: Document transparently

Make assumptions, disagreements, and judgement calls legible. Register or timestamp your analytic plan. Include identification arguments (DAGs), code, and data where possible. Report robustness and sensitivity analyses. Transparent reasoning is a scientific result in its own right.

Summary table

StepRequirementCore assumption
0Target populationGeneralisability
1Well-defined treatmentConsistency
2Time zero aligned to assignmentTemporal coherence
3Well-defined outcomeInterpretability
4ExchangeabilityConditional independence
5Causal consistencyNo interference, well-defined versions
6PositivityOverlap
7Measurement validityNo differential error
8RepresentativenessNo selection bias
9Transparent documentationReproducibility