Freakishly Good AI Voice Cloning is Now Open & Free...

6 месяцев назад

143,722 Просмотров

Комментарии:

@MattVidPro - 04.01.2024 01:42

IMPORTANT NOTE PLS READ The model on google colab is only using half performance due to restrictions of the free google colab. A Local install on a more powerful GPU should produce better results!!!!

Ответить

@Liminal-Mystic - 05.01.2024 04:23

It doesn't seem better than what I have tried so far, but it's pretty good. If it's really free to use, I will give it a try.

Ответить

@CaptTerrific - 05.01.2024 03:24

Guess I can cancel my ElevenLabs API subscription :D

Ответить

@fablegarmeth6897 - 05.01.2024 03:14

can anyone point me twords a local install tatorial?

Ответить

@mystrreos - 05.01.2024 02:52

Are fans of this channel called MVP's?

Ответить

@EARN_750_PERDAY_FROM_HOME - 05.01.2024 00:30

"Trust because you are willing to accept the risk, not because it's safe or certain." _Motivation

Ответить

@rubabSEOagency - 05.01.2024 00:08

Grow your video for my subtitle services in very cheap price so contact me

Ответить

@Sarkkoth - 05.01.2024 00:07

It's very impressive. Unfortunately in the fine details at the bottom it says it's not available for commercial use.

Ответить

@denisblack9897 - 04.01.2024 23:43

Dude, demos are very exaggerated. Wake up

Ответить

@rajnishmishra1666 - 04.01.2024 23:16

It. It now working on Google colab voice not changing can any one help

Ответить

@MoopEPoom0 - 04.01.2024 22:32

How do I do this? Do I have to have any computer programming knowledge to do it? Or is there a website. This is really cool!

Ответить

@rupertrobinson9028 - 04.01.2024 22:11

A moment of silence for everyone that isn't subscribed to MattVidPro.....always keeping us up2date!!!

Ответить

@aiartrelaxation - 04.01.2024 19:58

Yes its interesting, but that means WE CAN'T TRUST NO SPOKEN WORDS! !!!

Ответить

@HikingWithCooper - 04.01.2024 19:35

Morgan Freeman is going to be narrating a lot of short films now.

Ответить

@wadetompkins6560 - 04.01.2024 19:31

Canadian sounded very robotic.

Ответить

@paultoensing3126 - 04.01.2024 19:28

I have hours of recording of deceased relatives. Hopefully a longer training set or data set will yield much more fidelity with a lot more nuance, because it’s only greater sampling that you really get to sense the idiosyncrasies of a person. When will that be available?

Ответить

@robertmaxwellcole - 04.01.2024 18:49

Great post. Agreed. Ai models would greatly benefit from open sourcing more.

Ответить

@wartem - 04.01.2024 18:31

Does it work using AMD GPU and Windows?

Ответить

@fabioalexandrino2244 - 04.01.2024 18:27

It's so bad... This is not in a good level

Ответить

@itsfadixx - 04.01.2024 18:16

ok hear me out
they should use this as a way to voice dva sfm's 🔥

Ответить

@bloodust7356 - 04.01.2024 17:56

The french DVa had a little accent but that was pretty good. And usually french is bad so that's nice to hear a good generation

Ответить

@devm585 - 04.01.2024 17:43

it's just about time and the best free things come

Ответить

@poly06033 - 04.01.2024 17:35

Coqui ❤

Ответить

@MrAmack2u - 04.01.2024 17:10

the demo was a catfish lol

Ответить

@tommyapocalypse6096 - 04.01.2024 16:51

How would this work for musicians who are not singers, but have their own original music that they need vocals for? Would this work like something along the lines of Emvoice..?

Ответить

@ILIKECRYO - 04.01.2024 16:49

Important to note, that it is "Source Available" and not "Open Source" (As per their License, one can not use it commercially).

Ответить

@Marquis-Sade - 04.01.2024 16:18

How can you say this is freakishly good if you admit that the Demo is way better than the product itself?

Ответить

@marco114 - 04.01.2024 16:15

This is crazzzzyyy good.

Ответить

@scottmiller2591 - 04.01.2024 15:51

"Read me a bedtime story, Elon. Do all the voices."

Ответить

@Felipe-zl1rj - 04.01.2024 15:43

I've tested the colab one, and it was quiet terrible. Nothing like my voice at all. Hope the full version is more usable.

Ответить

@ashwinsveta - 04.01.2024 15:32

Omg 😅this is so cool!! Thanks for sharing this with us ❤

Ответить

@seraphin01 - 04.01.2024 15:17

Would love to see this ran locally on gpu without the limitations though.. I can't say it's the best free open source voice cloning.. Not by a long shot, but it's pretty good and it can do text2voice unlike Okada though..
I'm gonna check when I have the time how to install this locally, I could use it along Okada for different purposes

Ответить

@bigglyguy8429 - 04.01.2024 14:48

Yay, a pile of code and other vomit on github, which no normal people can use, yay! Awesome. Next?

Ответить

@wormemc - 04.01.2024 14:39

I'm sure the cloning of political people is why Obama didn't work well

Ответить

@spadaacca - 04.01.2024 14:30

I wonder if you run this by Adobe Podcast afterwards, it'll sound more natural!

Ответить

@maxfxgr - 04.01.2024 14:07

the 3,2,1 was chromie from Warcraft :D , great video Matt !

Ответить

@regio-report - 04.01.2024 13:53

Hi Matt! Again a very interesting video. But how can I use the this "jupyter notebook", because I need german language. Cheers

Ответить

@beardordie5308 - 04.01.2024 13:38

Non commercial license 😞

Ответить

@grahamatkinson9851 - 04.01.2024 13:00

One of the big trends in AI at the moment are examples that are cherry-picked, and cannot easily be replicated. Maybe a new, more believable, standard could be for AI companies to ask an outside agency to produce the examples of their products.

Ответить

@konstantinlozev2272 - 04.01.2024 12:56

It did not do very well with your voice...

Ответить

@setop123 - 04.01.2024 12:56

Really interesting, I compared it locally to CoquiTTS and it seems that it's way better for some voices, and way worse for others. On average it's on par with it but in a very complementary way. I suppose the way it works is they start from a reference voice and modify it to reduce the difference gap between it and the input so, if their reference voice is already pretty close it works wonder, but the opposite is also true.

Ответить

@konstantinlozev2272 - 04.01.2024 12:49

Anyone getting vibes of Terminator ability to clone voices of his victims?

Ответить

@pierre-samuelgreau-hamard6379 - 04.01.2024 12:49

As a french speaker, I can say that the french speaking is a little robotic, like someone unnaturally exaggerating articulation to avoid any sound blending between words.

Ответить

@harryireland5363 - 04.01.2024 12:18

its no better than any other free open source voice cloner ie total crap!

Ответить

@metanulski - 04.01.2024 12:17

German is ok, but has a american/strange accent

Ответить

@blacknubiantv - 04.01.2024 12:11

The license is Creative Commons Attribution-NonCommercial 4.0, so the use case for this code is limited.

Ответить

@64jcl - 04.01.2024 11:59

When they use the idiot Musk to demonstrate their products its simply a no-no for me. Why are people sucking up to that douchebag still? Also the thing doesn't sound anything like you so total fail.

Ответить

@FlowerMoon5 - 04.01.2024 11:56

Van use it for audiobookd

Ответить

@ThatGuy-yc9yc - 04.01.2024 11:49

South African here, and yeah its pretty good! Not perfect, but almost there, first time I heard a SA accent with AI LOL

Ответить

@fabiano919 - 04.01.2024 11:41

Hey, Matt. In my experience as a seasoned prompt engineer, the crux of the issue lies in the misalignment of NPUs during tensor randomizations in the preliminary tests. It's akin to attempting vector factoring with different tools – without a standardized NPU, the subtle nuances crucial for optimal voice cloning get lost in the mix. Achieving consistency in NPU specifications is paramount to unlock the full potential of this technology, and it's evident why the results in the video may have fallen short of expectations.
I really enjoyed your excitement, though. As always, I'm giving you a thumbs up for your hard work.

Ответить

Сейчас смотрят