Data Exercise
Psychology starts with a question about how people think or behave.
We know that bilingual children tend to perform better on various cognitive tasks. Why might this be the case?
How can we know whether it is bilingualism that causes better performance on various cognitive tasks?
“we may define a cause to be an object followed by another, and where all the objects, similar to the first, are followed by objects similar to the second [definition 1]. Or, in other words, where, if the first object had not been, the second never would have existed [definition 2].” - David Hume, Enquiries Concerning Human Understanding, and Concerning the Principles of Morals, Section VII
Y_i^{a = 1}: The cognitive ability of child i if they were bilingual. This is the counterfactual outcome when A = 1.
Y_i^{a = 0}: The cognitive ability of child i if they were monolingual. This is the counterfactual outcome when A = 0.
\text{Causal Effect}_i = Y^{a = 1} - Y^{a = 0}
We say there is a causal effect for individual i if:
Y_i^{a=1} - Y_i^{a=0} \neq 0
Robert Frost writes,
Two roads diverged in a yellow wood,
And sorry I could not travel both
And be one traveler, long I stood
And looked down one as far as I could
To where it bent in the undergrowth;
Then took the other, as just as fair,
And having perhaps the better claim,
Because it was grassy and wanted wear;
Though as for that the passing there
Had worn them really about the same,
And both that morning equally lay
In leaves no step had trodden black.
Oh, I kept the first for another day!
Yet knowing how way leads on to way,
I doubted if I should ever come back.
I shall be telling this with a sigh
Somewhere ages and ages hence:
Two roads diverged in a wood, and I—
I took the one less traveled by,
And that has made all the difference.
Robert Frost, The Road Not Taken
Recall the answers you proposed to bilingual causal question
How does this work?
\begin{align} E(\delta) = E(Y^{a=1} - Y^{a=0})\\ ~ = E(Y^{a=1}) - E(Y^{a=0}) \\ ~ = ATE \end{align}
The values of exposure under comparisons correspond to well-defined interventions that, in turn, correspond to the versions of treatment in the data.
\begin{equation} Y^{obs} = AY^{a=1} + (1-A)Y^{a=0} \end{equation}
For individuals with exposure level A = 1:
\begin{equation} \begin{split} (Y^{obs}|A = 1) &= 1 \times A \times Y^{a=1} + (1-1) \times Y^{a=0}\\ & = 1 \times Y^{a=1} + 0 \times Y^{a=0} \\ & = Y^{a=1} \end{split} \end{equation}
For individuals with exposure level A = 0:
\begin{equation} \begin{split} (Y^{obs}|A = 0) &= 0 \times A \times Y^{a=1} + (1-0) \times Y^{a=0}\\ & = 0 \times Y^{a=1} + 1 \times Y^{a=0} \\ & = Y^{a=0} \end{split} \end{equation}
Which implies:
\begin{equation} \begin{split} Y_i &= Y_i^{a=1}~~~\text{if}~ A_i = 1\\ Y_i &= Y_i^{a=0}~~~ \text{if}~ A_i = 0 \end{split} \end{equation}
The probability of receiving every value of the exposure within all strata of co-variates is greater than zero
\begin{equation} 0 < \Pr(A=a|L)<1, ~ \forall a \in A, ~ \forall a \in L \end{equation}
The conditional probability of receiving every value of an exposure level, though not decided by the investigators, depends only on the measured covariates
def: \coprod means “independent of”, a|b translates to “a conditional on b”
\begin{equation} Y^{a=1},Y^{a=0}\coprod A|L \end{equation}
or equivalently
\begin{equation} A \coprod Y^{a=1},Y^{a=0}|L \end{equation}
Where L is the set of co-variates sufficient to ensure the independence of the counterfactual outcomes and the exposure.
Where L is observed:
\begin{aligned} ATE = E[Y^{a=1}|L = l] - E[Y^{a=0}|L = l] ~ \text{for any value}~l \end{aligned}
Psychology starts with a question. “Does A cause Y?”
Directed Acyclic Graphs (DAGs) help visualize sources of bias.
There are five main sources of bias:
Important Note 1: In observational studies, it’s impossible to guarantee complete control for confounding. Always conduct sensitivity analyses. Techniques for sensitivity analyses will be discussed next week.
Important Note 2: Methods for computing causal effects for group comparisons will be covered in the following week’s lecture.