0.4 mA quiescent current is huge for very low-power designs though. If your system must spend the majority of its time consuming very little power, that level of constant current draw is a non-starter.
TPS7A20 is 6.5uA, $0.10/ku. TPS7A05 is 1uA, $0.19/ku. It's not that hard to do better than 400uA!
And if you don't mind coughing up when it counts... TPS7A02 is 25nA, $0.45/ku. Triple the price, but more than four orders of magnitude better leakage.
If money is no object, there are some interesting chips out there. The LTC3335 chip[0] is a buck-boost that claims to have a 680nA Iq. Not as low as the TPS7A02, but it looks like it could make up for that in efficiency if your design does anything other than sleep.
Maybe 300mA order of magnitude is more common than 3A, but it's not inconceivable. Lots of designs wake up, burst some RF data at a few hundred mA transceiver load for a few tens of microseconds, then go back to sleep for another few seconds. There are some long-interval scientific instruments that need high-speed ADCs or DACs on battery power in remote locations, which might get into higher current than a few hundred mA. Some deep space crafts can sleep at very low power and run their transmitters at comparable currents. It's very situational.
Older cell phone standards used to burst to about 2.3A upon initial connection to a tower before the tower told the handset to dial it back. I don't know about modern cell phone standards.