Is this right? The current best TTS from OpenAI uses gpt-4o-audio-preview which ...

pzo · 2025-03-20T18:34:27 1742495667

you can compare TTS pricing here: https://artificialanalysis.ai/text-to-speech

Previous offering from OpenAI was $15 for TTS and $30 for TTS HD so not 5x reduction. This one is slighly cheaper but definitely more capable (if you need control vibe)

fixprix · 2025-03-20T18:50:40 1742496640

That's a really cool page thanks. Does it have stats for other languages?

In my experience the OpenAI TTS APIs were really bad, messing up all the time in foreign languages. Practically unusable for my use case. You'd have to use the gpt-4o-audio-preview to get anything close to passable, but it was expensive. Which is why I'm using Google TTS which is very fast, high quality, and provides first class support for almost every language.

I look forward to comparing it with this model, the price being the same is unfortunate as there's less incentive to switch. The transcribe price is cheaper than Google it looks like so that's worth considering.

pzo · 2025-03-20T19:14:41 1742498081

Interesting for me Open TTS for Polish was better than Google TTS (but they have few options) - which one did you used? WaveNet?

Sadly haven't seen quality evaluation for TTS for foreign languages

fixprix · 2025-03-20T20:36:17 1742502977

Depends on what's available for the language, but yea Wavenet and Neural2. With OpenAI TTS I'd often get weird bugs where the first API call comes back all garbled, but the second API call comes back fine. Wasting money. On top of that more expensive and higher latency. I'm interested to try out this new one.