based on a few initial tests GPT-4.5 is abysmal. I find the prose more sterile than previous models and far from having the spark of DeepSeek, and it utterly choked on / mangled some python code (~200 LoC and 120 LoC tests) that o3-mini-high and grok-3 do very well on.