-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to reproduce the pitch, intonation and other nuances of a voice? I would like to find a tutorial or a guide (Intonation and Pitch in RVC or mastery of RVC Configuration)? #9
Comments
Sorry to break you up, but I have no knowledge either about those things. Since those variables are math stuff that I don"t understand yet and are full of matrix data. Iirc, Index files are like external/additional voice data, the more dataset you have the more bigger index file size. If you want to learn more, you could try asking the original RVC creator or you could try asking here |
Since you are the creator, can I ask you a direct question please?
|
First thing first, I"m not the original RVC creator, here is the original rvc project If you want to train a voice, you would need the voice dataset, it could be getting the voice inside the game or recording the audio from a cutscenes. "How long of the data do I need?" At least 10 minutes or more. It kinda depends, sometimes more dataset have its advantage than lower dataset. You just need to test it for yourself. If your voice has some kind of background sound you could use UVR (search on github) before training. .pth files are generated using pytorch library which contains model data. .pth that has been generated by rvc software, can only be used on rvc software. The generated .pth files from rvc software are not compatible with other software, since the data inside needs a specific handler. |
RVC ConfigurationI know how to reproduce a voice using a certain number of data, clean them and then run RVC. But I know nothing about the index file, or anythign related to: n_fft, hop_length, win_length, and n_mels, IVF, Flat ??
I would like to learn to not only reproduce a voice (vocal timbre) of someone, but also how to reproduce the pitch, intonation and other nuances of a voice.
Is that possible with RVC?
If so, where can I learn that please?
The text was updated successfully, but these errors were encountered: