Training acoustic model with predictions from the variance model #111

blueyred · 2023-07-12T07:22:05Z

blueyred
Jul 12, 2023

Once a variance model has completed training, would it be possible to use it's output, rather than the ground truth for training the acoustic model?

I would expect to get better results at inference, compared to the detatched method where they both get trained on the ground truth.

yqzhishen · 2023-07-12T10:20:21Z

yqzhishen
Jul 12, 2023
Maintainer

How can you train acoustic models with generated pitch but without the corresponding mel-spectrogram? We only have the ground-truth mel-spectrograms.

0 replies

blueyred · 2023-07-13T12:34:15Z

blueyred
Jul 13, 2023
Author

I was imagining using the f0 prediction & duration predictions to replace the ground truth in the acoustic training dataset (which has the mel-spectogram).

So train the acoustic network to learn to generate the mel-spectrogram from the output of the variance model, rather than the ground truth f0 & duration data.

The process could be a branch in the acoustic binarizer to take the raw training data, pass in fields to a trained variance model to predict the ph_dur & f0_seq, which would replace the ground truth ph_dur & f0_seq into the training data.

1 reply

yqzhishen Jul 13, 2023
Maintainer

No, the goal of acoustic models is to be as accurate as possible. Not using the ground truth duration and pitch will be misleading to the networks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training acoustic model with predictions from the variance model #111

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Training acoustic model with predictions from the variance model #111

blueyred Jul 12, 2023

Replies: 2 comments · 1 reply

yqzhishen Jul 12, 2023 Maintainer

blueyred Jul 13, 2023 Author

yqzhishen Jul 13, 2023 Maintainer

blueyred
Jul 12, 2023

Replies: 2 comments 1 reply

yqzhishen
Jul 12, 2023
Maintainer

blueyred
Jul 13, 2023
Author

yqzhishen Jul 13, 2023
Maintainer