What do z_dim and c_dim stand for？ #36

Hu-chengyang · 2022-09-07T12:01:30Z

Dear PHD:
Could you tell me what do z_dim:64 and c_dim:256 in config/model/default stand for？And what n_embeddings: 512 in config/model/default stand for?Thank you very much.

Wendison · 2022-09-11T09:45:56Z

Hi, all these three variables are related with content encoder, z_dim denotes the dimension of acoustic units (z) in VQ codebook, c_dim denotes the dimension of continuous vectors after LSTM (g-net in the paper) that takes z as inputs, n_embeddings is the number of acoustic units in VQ codebook.

Hu-chengyang · 2022-09-11T14:30:43Z

Thank you!

Hu-chengyang · 2022-09-13T12:58:08Z

In model_encoder.py/class Encoder(nn.Module)/def forwad(self, mels):
z = self.conv(mels.float()) # (bz, 80, 128) -> (bz, 512, 128/2)

what does 128 mean？What variable does it represent?
Thank you very much.

Wendison · 2022-09-20T11:44:43Z

128 is the number of frames of mel-spectrograms used for training, it denotes 1.28s of waveform.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What do z_dim and c_dim stand for？ #36

What do z_dim and c_dim stand for？ #36

Hu-chengyang commented Sep 7, 2022

Wendison commented Sep 11, 2022

Hu-chengyang commented Sep 11, 2022

Hu-chengyang commented Sep 13, 2022

Wendison commented Sep 20, 2022

What do z_dim and c_dim stand for？ #36

What do z_dim and c_dim stand for？ #36

Comments

Hu-chengyang commented Sep 7, 2022

Wendison commented Sep 11, 2022

Hu-chengyang commented Sep 11, 2022

Hu-chengyang commented Sep 13, 2022

Wendison commented Sep 20, 2022