Omnilingual ASR: Advancing automatic speech recognition for 1600 languages

(ai.meta.com)

27 points | by jean- 3 hours ago ago

6 comments

This seems like a massive improvement for openly available local ASR. Even the 300M model outperforms whisper-large-v3 according to the paper's benchmarks.

samat 38 minutes ago

How hard is it to make TTS out of this? A few independent journalists from Belarus asked for TTS in their language, but I am no expert, was thinking about re-using Mozilla's work. What's the easiest way to get working TTS for a language?

[-]

kulahan 15 minutes ago

From TFA, it says that it’s extremely easy to add new languages with just a few examples. I didn’t see specifics on how “few” it really is, though.

tschellenbach an hour ago

any insights on latency?

meetpateltech 3 hours ago

HF Demo: https://huggingface.co/spaces/facebook/omniasr-transcription...

GitHub: https://github.com/facebookresearch/omnilingual-asr

[-]

dang an hour ago

Thanks! I've added those links to the toptext as well.