No, you aren't the only one. It has become even worse now that Twitter will not render without JavaScript enabled. Unfortunately, I still do not know what Berkson's Paradox is because I will not enable JavaScript for Twitter.
The latter appears when analyzing subgroups gives a different result than analyzing the pooled data.
The former is about correlations that appear in samples which are not representative of the general population, due to the way that those samples are selected.
> The latter appears when analyzing subgroups gives a different result than analyzing the pooled data.
> The former is about correlations that appear in samples which are not representative of the general population, due to the way that those samples are selected.
You just said the same thing twice. Think about it.
For one you used terms like "subgroups" and "pooled data" and for the other "samples" and "general population". Those are the same things.
Then you used "[the effect] appears in" and in the other "correlations". Well, Simpsons paradox can also manifest itself in correlations. So you just said the same thing twice.
Simpson's paradox: analyzing trends per subgroup can give a different result than pooled data.
Berkson's paradox: analyzing a single subgroup selected with a function aggregating two traits (additively?) will indicate an anticorrelation between the traits.
Simpson's paradox says you can't judge group trends from subgroup trends. Berkson's paradox says given a group selected in a specific way, it will have a certain property in itself. They're just different statements.
Berkson's paradox is a special case of Simpson's for the two subgroups selected and non-selected.
The difference is that Berkson's paradox involves selecting the subgroup a posteriori and in a particular way, Simpson's paradox assumes a selection a priori.
I like how you put paradox in quotes. I also annoys me when people call these things paradoxes. They're more properly called counter-intuitive phenomena. I wonder if there's a single-word name for that.
So I actually checked and... turns out you're right.
According to Wikipedia, "paradox" can either mean "logically self-contradictory statement" or a "statement that runs contrary to one's expectation". I always thought that it meant the former only.
These two concepts should really really have separate words.
I think Berkson's paradox is more specific than just correlations arising from non-random sample selection. Correlations that are not representative of the general population could still be useful, if it's a meaningful correlation within some subgroup of interest. The problem is when the features you are correlating relate too closely to the features that were used for sample selection - then you can end up with a trivial result.
I've always learned of Simpson's paradox as relating more to different sample sizes when partitioning data, which can happen entirely arbitrarily - for example a baseball player getting injured part way through the season.
The fact that one player's at bats get partitioned differently than another's is not caused by the on field performance, so there's no "double dipping" going on like I would imagine with Berkson's. Conversely I'm having trouble fitting a Berkson's example into the framework of Simpson's paradox, since there's no reason the poorly-selected subpopulation can't theoretically be exactly half of the general population. And if all of the samples are of equal size Simpson's paradox doesn't exist anymore (because with equal bin sizes the mean of means is equivalent to the overall mean).
Interesting, I only ever heard of Simpson's paradox in the context of comparing overall averages versus subgroup averages.
I guess this paradox could then be thought of as a special case of Simpson's paradox? Since the out group will exclude people with both traits there should also be a negative correlation there, which disappears in the overall population. But in Berkson's case it seems they're implying the subgroup correlation is spurious whereas with Simpson's it could go either way.
> Since the out group will exclude people with both traits there should also be a negative correlation there
Not necessarily. Imagine the traits are distributed uniformly and independently in [-1 1]. There is no correlation:
******
******
******
******
******
******
If you select people with at least one positive trait you will find negative correlation in the group + but the correlation will still be zero in the group -.
In the first one you have a partition in subgroups A and B (or more than two) which show similar correlations, different from the correlation seen in A+B.
In the second one you have only a subgroup A (the implicit complement notA is not observed) where the correlation is not the same as in the (unobserved) full population A+notA. Nothing is said about the correlation in notA. It could be at either side of the correlation in the full population, while in Simpson’s paradox both subgroups are in the same side.
Edit: and I also mention "due to the way that those samples are selected" for Berkson's paradox where the selection is based on the variables of interest while in Simpson's paradox the subgroups are "external" (but influence the correlation between those variables).