POLSCI 4SS3
Winter 2024
Decide if you will sign up for final project by April 4
Instructor traveling April 3-7
Data science, computer science, statistics
Computational/quantitative social science
Econometrics
Evidence-informed policy
Public administration
Business, marketing
Data strategy
|
||
---|---|---|
Inquiry | Observational | Experimental |
Descriptive | Sample survey | List experiment |
Causal | Quasi-experiment | Survey/field experiment |
Data strategy
|
||
---|---|---|
Inquiry | Observational | Experimental |
Descriptive | Sample survey | List experiment |
Causal | Quasi-experiment | Survey/field experiment |
Instead of \(Z\) causing \(Y\), \(Y\) causes \(Z\)
Simultaneity: \(Z\) causes \(Y\) and vice versa
Instead of \(Z\) causing \(Y\), \(Y\) causes \(Z\)
Simultaneity: \(Z\) causes \(Y\) and vice versa
Example
Students who are likely to participate enroll in Political Science courses more often
Example
We believe that more education increases income
But having smart parents increases both education and income
Individuals sort into condition \(Z\) in a manner that predicts outcome \(Y\)
Treatment and control are not comparable
Individuals sort into condition \(Z\) in a manner that predicts outcome \(Y\)
Treatment and control are not comparable
Example
Random assignment avoids this in expectation
Hard to overcome with observational causal data strategies
Need to pretend that we can analyze data as if it was an experiment
Answer strategies that produce data as-if they were drawn from an experiment
Natural experiment: Random assignment outside of the researcher control
Example: Choosing municipalities at random for auditing
Quasi-experiment: Conditions are assigned in a manner that is sufficiently orthogonal to potential outcomes
Score (running variable)
Cutoff (threshold)
Treatment (at least two conditions)
Local randomization
Continuity-based
Potential outcomes are not random because they depend on the score (and other things)
However, around the cutoff, treatment assignment is as good as random
Example: Barely winning an election
So we can pretend we have an experiment within a bandwidth around the cutoff
Treatment assignment is deterministic at the cutoff
Example: Financial aid if income below a threshold
But usually too few or no units at the cutoff
Task: Approximate the gap at the cutoff as best as possible
This becomes a line drawing problem
At least two groups or conditions (treatment,control)
At least two time periods (pre- and post-treatment)
Once treated, units stay on
We accept that selection bias is unavoidable
But comparing before-after changes between groups allows us to calculate treatment effect
Timing
|
||
---|---|---|
Group | Before | After |
Treatment | A | B |
Control | C | D |
\[ \widehat{ATE} = [\text{Mean}(B) - \text{Mean}(A)] - [\text{Mean}(D) - \text{Mean}(C)] \]
Timing
|
||
---|---|---|
Group | Before | After |
Treatment | A | B |
Control | C | D |
\[ \widehat{ATE} = \underbrace{[\text{Mean}(B) - \text{Mean}(A)]}_\text{Difference} - \underbrace{[\text{Mean}(D) - \text{Mean}(C)]}_\text{Difference} \]
Timing
|
||
---|---|---|
Group | Before | After |
Treatment | A | B |
Control | C | D |
\[ \widehat{ATE} = \underbrace{\underbrace{[\text{Mean}(B) - \text{Mean}(A)]}_\text{Difference} - \underbrace{[\text{Mean}(D) - \text{Mean}(C)]}_\text{Difference}}_\text{Difference in differences} \]