Does anyone benchmark these models for text-to-speech using traditional word-err... | Hacker News

Hacker Timesnew | past | comments | ask | show | jobs | submit

ks2048 7 months ago | parent | context | favorite | on: Trying out Gemini 3 Pro with audio transcription a...

Does anyone benchmark these models for text-to-speech using traditional word-error-rates? It seems audio-input Gemini is a lot cheaper than Google Speech-to-text.

simonw 7 months ago [–]

Here's one: https://voicewriter.io/speech-recognition-leaderboard

"Real-World Speech-to-text API Leaderboard" - it includes scores for Gemini 2.5 Pro and Flash.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact