Causal Inference - The Basics
\[ \def\BB#1{{\mathbb{#1}}} \def\BF#1{{\mathbf{#1}}} \]
yedlu, Spring 2025
Regression
We begin with cross-sectional analysis (i.e. data on certain observables in one given period).
The Linear Model
Suppose we want to study the effect of education on wages (very cliche but classic). Let’s denote education level as \(x\) and annual wages as \(y\). Then, we need to think about the following questions before moving forward:
We aim to provide simple yet intuitive answers in this section. Let’s start with an extremely simple linear function (or model) that we expected to be true for the whole population:
If we were to assume this to be true for the whole population, our goal of the empirical work is then to use a sample (for most cases we can only access information on a subset of the population) to estimate these aforementioned values. (It is impossible to observe the parameters in real life, we can only try to estimate them.)