This isn't a new problem. The 80-bit internal 8087 FPU precision has always been...

ars · on Jan 6, 2010

For people unfamiliar with this, "spilled to memory" means rounded to 64 bits (from 80).

x87 does floating point math with 80 bits (in registers). But the variables are stored in memory locations with 64 bits. So the results are rounded.

The problem comes from when to do the rounding. And that can vary depending on if a register is needed for something else, optimizations, the order of code, etc.