Reporting Guide
This guide covers how to report results from causal inference analyses. It follows the nine-step causal workflow and demonstrates best practices for communicating average treatment effects, heterogeneous effects, and sensitivity analyses. This resource directly supports the research report assessment (40%).
The nine-step causal inference workflow
Before reporting results, ensure each step has been addressed.
Steps 1–3: problem definition
- Well-defined treatment. Specify the exposure precisely, including the contrast (e.g., "weekly religious service attendance vs. less than weekly").
- Well-defined outcome. State the outcome measure, its scale, and when it was assessed.
- Target population. Define who the results apply to, including any weighting for population representativeness.
Steps 4–6: identification strategy
- Exchangeability. Describe how conditional independence was achieved (e.g., "rich baseline covariate control including 32 covariates").
- Consistency. Explain why the treatment is well-defined and uniform across individuals.
- Positivity. Report verification through transition tables showing exposure variation across covariate strata.
Steps 7–9: implementation
- Measurement validity. Note the psychometric properties of outcome scales.
- Attrition handling. Describe how panel dropout was addressed (e.g., inverse probability of censoring weights).
- Transparent reporting. Document all analytical decisions and assumptions.
Frame your causal question as: "How would outcomes change if we intervened to set everyone's exposure to level rather than , conditional on baseline characteristics?"
Reporting average treatment effects
Standard ATE table format
Include these elements for each outcome:
| Outcome | Estimate (SD units) | 95% CI | E-value | Interpretation |
|---|---|---|---|---|
| Outcome A | 0.12 | [0.08, 0.16] | 1.8 | Small positive effect |
| Outcome B | 0.15 | [0.11, 0.19] | 2.1 | Moderate positive effect |
Key reporting elements
- Effect sizes: report in standard deviation units for continuous outcomes.
- Confidence intervals: show uncertainty around each estimate.
- E-values: indicate robustness to unmeasured confounding.
- Sample size: total analysed after exclusions and weighting.
Example results text
"Weekly religious service attendance showed positive causal effects across all cooperation measures. The largest effects were observed for social outcomes: sense of belonging (, 95% CI: 0.14–0.22) and social support (, 95% CI: 0.11–0.19). All effects were robust to moderate unmeasured confounding (E-values > 1.6)."
Reporting heterogeneous treatment effects
RATE results summary
When heterogeneity is detected, report it systematically:
| Outcome | RATE-AUTOC | p-value | RATE-Qini | p-value | Evidence |
|---|---|---|---|---|---|
| Social support | 0.12 | 0.003 | 0.08 | 0.012 | Strong |
| Belonging | 0.15 | 0.001 | 0.11 | 0.004 | Strong |
| Charitable donations | 0.06 | 0.089 | 0.04 | 0.156 | Moderate |
| Volunteering | 0.03 | 0.234 | 0.02 | 0.445 | Weak |
Example heterogeneity text
"We found substantial heterogeneity in treatment effects for social outcomes (RATE-Qini > 0.08, ), but limited heterogeneity for behavioural outcomes. This suggests that while religious service benefits most people socially, individual responses vary considerably in magnitude."
Reporting policy tree results
Present policy trees with both standardised and original-scale interpretations.
Subgroup identification with data-scale effects
High-response subgroups for charitable donations:
-
Older adults with high agreeableness (age > 45, agreeableness > +1 SD)
- Standardised effect: (95% CI: 0.21–0.35)
- Data-scale effect: NZ$847 additional annual donations (95% CI: NZ$635–1,058)
- Sample proportion: 23%
-
Parents with medium conscientiousness (parent = yes, conscientiousness > 0)
- Standardised effect: (95% CI: 0.16–0.28)
- Data-scale effect: NZ$665 additional annual donations (95% CI: NZ$484–846)
- Sample proportion: 31%
-
All others
- Standardised effect: (95% CI: 0.04–0.12)
- Data-scale effect: NZ$242 additional annual donations (95% CI: NZ$121–363)
- Sample proportion: 46%
Example policy tree text
"Policy tree analysis identified two subgroups with enhanced treatment response for charitable donations. Older adults (45+) with high agreeableness showed the largest increase ( SD, equivalent to NZ$847 additional annual donations), representing 23% of the sample. In practical terms, targeted interventions toward these subgroups could generate 2.8–3.5 times more charitable giving than population-wide approaches."
Sensitivity analysis: E-values
Interpretation
E-values quantify robustness to unmeasured confounding. The E-value is the minimum strength of association, on the risk ratio scale, that an unmeasured confounder would need with both the treatment and the outcome to explain away the observed effect.
| E-value range | Interpretation |
|---|---|
| Robust to strong confounding | |
| 1.5–2.0 | Robust to moderate confounding |
| Vulnerable to weak confounding |
Example sensitivity text
"To assess robustness to unmeasured confounding, we calculated E-values for all estimates. The observed effect on sense of belonging (E-value = 2.4) would require an unmeasured confounder associated with both religious service and belonging by a risk ratio of 2.4 each to explain away the result."
Methods section template
A complete methods section following the nine-step workflow should include:
- Treatment definition: what the exposure is, how it is coded, and the contrast of interest.
- Outcome definition: measures used, timing of assessment, any transformations applied.
- Target population: sampling frame, weighting strategy, eligibility criteria.
- Causal identification: covariates conditioned on, justification for conditional exchangeability.
- Statistical analysis: estimation method, key tuning parameters (e.g., number of trees, minimum node size).
- Attrition handling: censoring weights, stages of dropout addressed.
- Heterogeneity assessment: RATE metrics, false discovery rate correction, policy tree depth.
- Sensitivity analysis: E-values for all primary estimates.
Reporting checklist
Do report
- Both standardised and data-scale effects
- Effect sizes with confidence intervals
- Sample sizes after exclusions and weighting
- E-values for sensitivity analysis
- Clear practical interpretation of effect magnitudes
- Subgroup sizes and effect magnifications
- Target trial framework and causal question
- Explicit treatment and outcome definitions
Do not report
- Model coefficients without interpretation
- p-values alone without effect sizes
- Only standardised effects for policy-relevant outcomes
- Technical details that obscure main findings
- Causal claims beyond your identification strategy
Figure presentation
ATE plots
- Use forest plots with confidence intervals.
- Order by effect magnitude or E-value.
- Include sample sizes.
Policy tree plots
- Show decision rules clearly.
- Include sample proportions in each node.
- Provide plain-language interpretation alongside the tree.