Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mel stats and Vocoder #32

Open
winddori2002 opened this issue Apr 26, 2022 · 2 comments
Open

Mel stats and Vocoder #32

winddori2002 opened this issue Apr 26, 2022 · 2 comments

Comments

@winddori2002
Copy link

Hi,
I try to reproduce your paper and I encounter a problem with mel stats and vocoder.
When I use your pre-trained vocoder and mel stats, I can notice the speech synthesis performance is quite good.
However, when I run the preprocess code and get new mel stats, the speech synthesis performance degrades on the same pre-trained vocoder.
Thus, the questions are as below:

1.) I wonder if I get new mel stats, it is necessary to train the vocoder again.
2.) I wonder if you use mel stats from the preprocess code for vocoder input normalization.

Thank you

@Wendison
Copy link
Owner

Hi, based on my experience, using the same mel stats for vocoder and VC model leads to better voice quality, so for your questions:

  1. I think that training a vocoder using the new mel stats could generate the speech with higher quality, or you can use my provided mel stats (from PWG vocoder trained by VCTK) to normalize mels for training the VC model.
  2. the mels stats for vocoder input normalization is not from preprocess code, it is from the PWG repo for preprocessing mels.

@winddori2002
Copy link
Author

Thank you for answering!
I understand and solve it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants