It is funny that although I have published peer-reviewed papers about evaluation...

pfortuny · on April 9, 2020

This is exactly my experience. I have never known what type the error I was talking about was...

Imagine calling (for instance) type I groups those which are abelian and type II those which are not... And then not being a professional mathematician. So is xy=yx here? Mmmmhhh type I says yes or was it no?

laichzeit0 · on April 9, 2020

I have the same problem with specificity and sensitivity. Always have to look up the damned terms even though I’ve used them countless of times.

ImaCake · on April 9, 2020

They mean what they would mean in common english.

Specificity because "how specific is the test? Does it measure the true positives correctly without getting a whole heap of false positives as well?" You might ask someone to be "more specific" about something so they aren't including irrelevant things in their discussion.

Sensitivity because "how sensitive is the test? Does it detect the needle in the haystack you need to find? How many does it miss." In common english you might complain that your car brakes are too sensitive - they are too quick to register the pressure from your foot.

Don't forget these are actually strict mathematical concepts. Hope this helps clarify :)

parekhnish · on April 10, 2020

I think the issue people have with these names is that their English meanings (as you described) make sense when the positive class is not as prevalent as the negative class. If they are equally probable (or worse, if it's the opposite), then the English meanings quickly become out-of-context

ashfromconvert · on April 14, 2020

Thank you for your clear explanation, indeed Specificity instead of Type I errors, and Senvisity for Type II errors, make a lot more sense!

twanvl · on April 9, 2020

Imagine mathematicians calling commutative groups abelian? How do you remember if xy=yx there?

jerf · on April 9, 2020

"Abelian" is at least a fresh new concept to hang your own associations off of, with no previous interference, and without interference from the similarly-named "Adelian" groups or something equally stupid.

The problem isn't just that the term is something you've never heard before, but that "I" and "II" is not a very good concept to try to hang them off of. This is relevant to software engineering naming too: In general, you should not use naming schemes that imply properties that don't actually exist in your values. I and II have all sorts of properties that don't apply to the terms in question, most noticeably, they have an order. But, which is "first", false positives or false negatives? They don't have a natural order. Using numbers to name them just gets in the way.

(Especially when there are perfectly serviceable words.)

Math jargon isn't perfect by any means. But it does at least avoid naming things by sheer numbers most of the time, unless there isn't really a choice because it needs a few hundred names right now.

(Similarly, pop quiz: In Kahneman's classification scheme, is System 1 the fast or the slow system? Odds are, even if you get that right, it's because the book title "Thinking, Fast and Slow" is something that stuck and it happen to be in order. It probably wasn't because you remember them by number.)

thaumasiotes · on April 9, 2020

> Imagine mathematicians calling commutative groups abelian? How do you remember if xy=yx there?

Actually, even if you ignore jerf's response, this is different in an important way from the "Type I" / "Type II" terminology.

Group in which the group operation is commutative: "Abelian group".

Group with no guarantees except the group axioms: "group".

The special one is marked and the non-special one is unmarked. In contrast, the designations "Type I" and "Type II" are parallel; it's not at all obvious which one is the default and which one deviates from the default.

beagle3 · on April 9, 2020

Maybe I can help you with that ... the mnemonic I developed is:

Type I error, with probability often denoted by α (alpha) which is the first (I) letter of the alpha bet. AL-PH-A stands for al ALlegedly PHalse Alarm. Or just fALse-Positive-HA!

(yes, it's stupid, and weird, but has been helping me remember it for nearly three decades now).

andrewla · on April 9, 2020

As another commenter noted, the story of the Boy Who Cried Wolf is an easy mnemonic -- the villagers committed a series of Type I errors, and then a Type II error.

jackallis · on April 9, 2020

what are those "superior terminology"?

amluto · on April 9, 2020

“False positive” and “false negative” are pretty good.

twanvl · on April 9, 2020

They are certainly better than Type I and Type II, but it is still a potentially ambiguous (at least as a non-native speaker). What makes a "false positive" false? Is it called a false positive because it is actually a negative, or is it called a false positive because it is a positive for which you made the error of calling it negative?

setr · on April 9, 2020

That much is fine -- it's ambiguous if you don't know the general idea in the first place -- it's true of most things. The problem with type 1/2 is that it's so utterly devoid of memory hooks that even if you recognize it, and know the idea, you can't confidently identify which is which.

thaumasiotes · on April 9, 2020

> or is it called a false positive because it is a positive for which you made the error of calling it negative?

Not to pick on the non-nativeness of the problem here, but that's not really a way you can use "false". I'd be a lot more comfortable calling an underlying positive that tested negative an "unidentified positive" or a "misdiagnosed positive" [this one really is ambiguous in exactly the way you suggest] or anything else that suggested that the positive was there and an error occurred in noticing it, as opposed to suggesting that the positive wasn't there in the first place.

So, it's a fair complaint for non-native speakers, but you just can't choose all your terminology to meet their needs. :/

jdashg · on April 9, 2020

It's a false positive when the test comes back positive, but it was wrong. "Test falsely reported positive"

naasking · on April 9, 2020

Substitute "incorrect" for "false", and it's all clear, ie. "incorrect positive result", and "incorrect negative result".

RosanaAnaDana · on April 9, 2020

Even reading this thread, I feel like I need to get to get a tattoo that says type I : false positive ; type II : false negative

downshun · on April 9, 2020

Mistaken positive and mistaken negative may sound more intuitive.

Too many negativey terms being used: null (hypothesis), false, negative, error