Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Glossary and DAG Hand-outs

A reference page covering causal inference terminology and links to hand-out PDFs on directed acyclic graphs (DAGs), confounding, selection bias, and measurement error.


Causal inference glossary

Causal inference rests on mathematical foundations that enjoy broad agreement, but the terminology varies across disciplines. The same word sometimes carries different (even opposite) meanings in different literatures. Terms to watch include "selection", "fixed effects", "standardisation", "moderator", "structural equation model", and "identification".

Core concepts

TermDefinition
Average Treatment Effect (ATE)The expected difference in potential outcomes across the entire population: . Also called the marginal effect.
Conditional Average Treatment Effect (CATE)The ATE within a subgroup defined by covariates : .
Potential outcomesThe outcomes that would be observed under each possible treatment level. For individual : under treatment, under control. Also called counterfactual outcomes.
CounterfactualThe potential outcome corresponding to the treatment level not actually received. Unobservable for any given individual.
Causal consistency when . The observed outcome equals the potential outcome under the treatment actually received. Requires well-defined treatment and no interference.
ExchangeabilityPotential outcomes are independent of treatment assignment: . In observational studies, we require conditional exchangeability: .
PositivityEvery subgroup has a non-zero probability of receiving each treatment level: .

Graphical concepts

TermDefinition
DAG (directed acyclic graph)A graph with directed edges (arrows) and no cycles. Used to encode causal assumptions about which variables influence which.
ConfounderA common cause of both the exposure and the outcome. Creates a non-causal (backdoor) path that must be blocked for valid causal inference.
ColliderA variable caused by two or more other variables on a path. Conditioning on a collider opens a spurious association.
MediatorA variable on the causal pathway between exposure and outcome (). Conditioning on a mediator blocks the indirect effect.
Backdoor pathA non-causal path from exposure to outcome that passes through a common cause. Blocking all backdoor paths satisfies the backdoor criterion.
d-separationA graphical criterion for determining conditional independence. Two variables are d-separated given a set if every path between them is blocked by .

Estimation and sensitivity

TermDefinition
Propensity scoreThe probability of receiving treatment given covariates: . Used for weighting, matching, or stratification.
E-valueThe minimum strength of association an unmeasured confounder would need with both treatment and outcome (on the risk ratio scale) to explain away an observed effect.
RATE (Rank Average Treatment Effect)A metric for assessing treatment effect heterogeneity. Measures how well a prioritisation rule identifies individuals with larger effects.
QINI curveA cumulative gain curve showing the benefit of treating individuals in order of predicted treatment effect. Area under the QINI curve summarises heterogeneity.
Policy treeA decision tree that assigns treatment based on covariates to maximise a welfare criterion. Used for identifying high-response subgroups.
Doubly robust estimationAn estimation strategy that yields consistent causal estimates if either the outcome model or the propensity score model (but not necessarily both) is correctly specified.

DAG hand-outs

The following hand-outs cover DAG conventions, common structures, and specific forms of bias. All PDFs are available for download from the hand-outs folder (Dropbox).

Foundations

Hand-outTopic
1a. Local conventionsConventions for causal diagram construction and interpretation
1b. Directed graph terminologyCore terminology for directed acyclic graphs
S1. Graphical keyVisual reference guide for DAG symbols and notation
S2. GlossaryComprehensive glossary of causal inference terminology

Common applications

Hand-outTopic
2. Common causal questionsFrequently encountered causal questions and how to set them up
6. Effect modificationWhen and how treatment effects vary across subgroups
9. External validityGeneralising causal findings across populations and contexts

Time series and confounding

Hand-outTopic
3. Time series approachesHow longitudinal data help address confounding bias
4. Three-wave panel methodsUsing three-wave panel data for causal inference
5. Time series limitationsWhen time series approaches may not resolve confounding
S3. Time-resolved confoundingAdvanced approaches to time-varying confounding

Advanced topics

Hand-outTopic
7. Selection biasSelection bias in longitudinal studies
8. Measurement errorStructural approaches to representing and addressing measurement error
10. Experimental designHow experiments address confounding and selection bias

Supplementary materials

Hand-outTopic
S5. Timing examplesPractical examples of confounding and timing issues
S6. Detailed panel examplesWhat can go wrong in a three-wave panel
S7. Cross-sectional approachesWhen to report multiple DAGs in cross-sectional studies
S8. Bias correctionQuantitative approaches to bias correction
S9. Mediator biasConfounding bias in mediation analysis
S10. Misclassification biasExamples of misclassification bias and bias towards the null

Accessing hand-outs

All PDFs are in the hand-outs folder on Dropbox. File names match the numbering in the tables above (e.g., 1a-terminologylocalconventions.pdf, S2-glossary.pdf).