AI inference costs dropped up to 10x on Nvidia's Blackwell

(venturebeat.com)

6 points | by CrankyBear 13 hours ago ago

3 comments

  • simianwords 12 hours ago

    > Sully.ai cut healthcare AI inference costs by 90% (a 10x reduction) while improving response times 65% by switching from proprietary models to open-source models running on Baseten's Blackwell-powered platform, according to Nvidia. The company returned over 30 million minutes to physicians by automating medical coding and note-taking tasks that previously required manual data entry.

    Are the margins that low that it would make sense to give up on quality of output and use open source models?

    • mountainriver 11 hours ago

      Interviewed at Sully, absolutely insane company, I’m sure they aren’t making a rational decision here

  • 12 hours ago
    [deleted]