I wrote this for my another purpose, and thought that I might as well make it public.
For a treatment-control contrast, let’s examine the regression
model
with and being a 0/1 indicator variable
for treatment (1) vs control (0). Assume that the proportion of
treated units is . Now, since the OLS estimate is consistent (randomization), the limit of can be calculated to be
Expressing the treatment effect in standardized form (Glass ),
we can write , and then we have
If we also assume that the treatment and control arms are equally large (, which gives us the largest
possible given the treatment effect), we get and
We can draw this for different values of :

We see that even for treatment effect sizes that are quite respectable, the amount of explained variance is quite limited. For a effect, .