Комментарии:
I think the noise truncation is really useful! It might benefit from having a video about this itself or mentioned in the first data curation video! I am making my own Melina dataset to test with and the video I use as a reference has a lot of silence after applying UVCR on it.
Ответитьthats way split data set its better :)
Ответить[W CUDAGuardImpl.h:124] Warning: CUDA warning: out of memory (function destroyEvent)
Got this error, is running a 1h not splitted set is too much?
do i need to split it?
What if you mix both seperated and unseperated? Also how many epochs was this example trained on?
ОтветитьBut shouldn't the dataset consist of a vocal file in MONO???
Ответитьif I choose that a module should have no tone and I train it in the new version of RVC, I can still choose which tone algorithm to use. This means that it still uses RMVPE, i.e. the new version and the quality is not particularly good either. Hope it gets fixed. try to choose false in the old and in the new version.
ОтветитьHi Jarod, thank you so much for your videos!
I still have a question though. So i have 12-14 minutes audio of pure voice. I truncated silence, removed noise, reverb, echo, sibilance. What should i do? So you are telling us that simply dividing the audio file into 10 seconds is not desirable right? And I should clip the audio into meaningful bits with complete sentences, for which you btw use whisperx?
If so, is whisperx good for let's say nonenglish languages? For example languages of central asia or let's say exotic languages?
hi rvc is now ot allow on colab for free note book whats the alternative
ОтветитьDude, couldn't you get a better quality result if the silences of the single piece file were left?
I mean, wouldn't you have gotten a better result if you didn't truncate?
How would you know?
what have you to say about the Google Colab Crash? many users canot use colab anymore as google is cracking down on deepfakes code and banning IPs.
ОтветитьHow to uninstall the ai voice-changer program? Because on the app page that we want to uninstall, there is no program name ai voice-changer. Or we can delete the extracted file right away because it runs with the command program.
ОтветитьAnd the ideal duration ? How much ? :/
ОтветитьI've tested it, and the Splits version is way way better. The non-splits one trained faster, but the result is worse.
ОтветитьSo I tell you the cause of the problem that I have
ОтветитьDo you have an Instagram to connect with you
ОтветитьThe sound files I recorded are 44.1 khz but there is no such sample rate in rvc training. There are only 40k and 48k sample rates. Which one should we choose in this case? After UVR, I cleaned the silence in audacity as you explained in the video, then I set the sample rate to 48000 from the settings and saved it. I did the training in two different ways with the same dataset (by selecting 40k and 48k sample rate in rvc). In Tensorboard, the 48k sample rate result resulted in less loss than the 40k sample rate result.
ОтветитьHow to install in RVC in windows say it
ОтветитьHello! Thanks for the video. Could you say where to get well prepared voice audios for training, please?
ОтветитьCan I do all this with a mobile phone? Someone please answer
ОтветитьBro how do you expect us someone to understand what you're talking about when you talk like people watching are professionals? Please explain in simple terms
ОтветитьGreat video as always, I was wondering if you know any way to separate voices, for example if there are two or more people talking. Would love to see a video on that!
ОтветитьI have always had the doubt if I should also normalize the sound, for example the audio of video game characters, they have dialogues in which they scream, get excited and use a great variety of voice tones. Very different from your dataset which looks very drab. Besides trimming silences with Audacity, is it worth using the Normalize Audio option to avoid spikes caused by shouting or loud dialog? Or should they stay natural? Should I do some other transformation?
ОтветитьPlease 🥺🙏 I have a 500 track in my pc i need convert in one time 😭
ОтветитьRtx 3080 12GB any good for Ai?
ОтветитьI think the separated version is much better, and will continue using it as well. For the whisperer tool you mentioned is it the audio splitter you made (the one that removes silence and separate files that's over 10 sec)?
ОтветитьMy model will still sound like a fish
Ответить