Show HN: Voice-Pro – AI Voice Cloning Magic: Transform Any Voice in 15 Seconds

(github.com)

48 points | by abuskorea 3 hours ago ago

25 comments

shannifin 36 minutes ago

I don't have much real use for celebrity voices (other than fun experimentation), but I'd love to be able to clone my own voice and character voices for the purposes of creating audiobooks / audioplays without having to pay monthly fees with monthly usage limits. So I'm excited by this sort of project!

P.S. Are there any tools for synthetic voice creation? Maybe melding two or more voices together, or just exploring latent space? Would be fun for character creation to create completely new voices.

[-]

dyauspitr 33 minutes ago

I’ve used tortoise tts before and trained it on my voice and a mix of voices. It’s not perfect but still impressive.

joshdavham 15 minutes ago

Looks cool! Also, is there a reason you went with a Web-UI instead of making a native desktop app?

safeimp an hour ago

Project looks interesting. Are there short term plans to support MacOS?

If not, any recommendations for alternative projects?

muglug 2 hours ago

These tools make it very easy to scam vulnerable people, and have pretty limited use otherwise.

[-]

chefandy an hour ago

To be fair, they’ve got pretty serious potential for letting tech companies get paid for a seasoned voice actor’s unique delivery, tone, inflection, etc rather than the voice actor themselves.

Larrikin an hour ago

I'm absolutely using celebrity voices for my Home Assistant voice. Amazon has spent the last couple years removing the voices for Alexa that people had paid for.

chefandy an hour ago

Gen AI space to everyone else: “Your computer scientists were so preoccupied with whether or not they should, they didn’t stop to think if they could just do it anyway”

ranger_danger an hour ago

How many victims will it take for lawmakers to do something about this?

[-]

tiborsaas 30 minutes ago

It's already illegal to scam somebody. While it's always positive to protect people more, what can be done here? Any alternative I can imagine is massively oppressive of the current state of the software industry.

You can regulate large companies, you can regulate published software sold for profit, but it's impossible to regulate free and open source tools.

You essentially have to regulate access to computing power if you want to prevent bad actors doing bad things using these sort of tools.

russell_h 30 minutes ago

Serious question: what do you think lawmakers should do?

tsujamin 2 hours ago

Bulldozing grandma is just the cost of technological progress /s

[-]

uh_uh an hour ago

This tech is going to be ubiquitous, it's just too easy to distribute it. Grandma better starts adapting now.

[-]

chefandy an hour ago

You can’t adapt around brain age making it more difficult to distinguish truth from lies.

thejazzman an hour ago

Because people make it so, not because the natural order of the world gets us there

For some reason because we can validates that we should. Any jackass has the power of a research team of phds. It's kinda weird.

[-]

chefandy an hour ago

Indeed. Humans ascended to dominance because we can cooperate. This every-man-for-themself idea is an aberration, not the natural order as so many claim. It’s rather astounding to think otherwise considering the logistics of how we’re communicating right now.

[-]

uh_uh an hour ago

Cooperation works if the potential damage caused by a rouge actor is sufficiently low. Otherwise, it's too easy to sabotage things. This is why we don't want random rouge states to have nukes. AI will give so much leverage to rouge actors that it will significantly shift the game theory in favour of not cooperating.

uh_uh an hour ago

Demanding responsible behaviour from everybody is not going to work. Some people don't care about negative externalities that much and it's enough if only a few of them decide not to play ball. So either grandma needs to adapt which will upset some people or distributing the tech should be regulated/prosecuted which will upset another group of people.

jncfhnb 2 hours ago

Is there speech to speech? I have been hoping for a model I can use to do voice acting with inflection

[-]

amrrs 2 hours ago

Do you mean Inflection's Pi?

yawnxyz an hour ago

> When Windows Defender mistakenly recognizes a [virus] as a Trojan, this is often called a 'False Positive'. To solve this problem, you can go through the following steps:

[-]

kfarr an hour ago

Yeah I also noticed the install instructions is run this batch file that gets administrator access and starts downloading things…

[-]

gruez an hour ago

It's not any worse than all the projects on github with an "easy" install instructions of "curl ... | sudo sh". Heck, even an innocent "sudo make install" command can easily contain a malicious payload.

[-]

chefandy an hour ago

Yeah it’s not great but it’s definitely not unusual. And windows reputation-based execution blocking does have false positives. I work for a company that has some very very popular products and some that only see a few dozen downloads per week, and despite being signed, it still takes a while for new versions to build enough rep to not trigger the block.

ilrwbwrkhv an hour ago

There are a bunch of yc start-ups who are building new models and stuff in the space. I fear they are going to get decimated really soon as the quality of local llamas keep improving.