6 comments

  • stuffoverflow 23 minutes ago

    This seems like a massive improvement for openly available local ASR. Even the 300M model outperforms whisper-large-v3 according to the paper's benchmarks.

  • samat 38 minutes ago

    How hard is it to make TTS out of this? A few independent journalists from Belarus asked for TTS in their language, but I am no expert, was thinking about re-using Mozilla's work. What's the easiest way to get working TTS for a language?

    • kulahan 15 minutes ago

      From TFA, it says that it’s extremely easy to add new languages with just a few examples. I didn’t see specifics on how “few” it really is, though.

  • tschellenbach an hour ago

    any insights on latency?

  • meetpateltech 3 hours ago
    • dang an hour ago

      Thanks! I've added those links to the toptext as well.