Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use this repository? #5

Closed
Mr47121836 opened this issue Jun 29, 2024 · 2 comments
Closed

How to use this repository? #5

Mr47121836 opened this issue Jun 29, 2024 · 2 comments

Comments

@Mr47121836
Copy link

I have reviewed the paper, which asserts that there is no audio or information from unknown speakers during the training process. However, in your code, your validation set includes information from unknown speakers. Doesn't this imply that the model had already assimilated information from unknown speakers during training, and the reference speech used for inference corresponds to the speech encountered during training?

image
image
For speaker p261,the model already meet this speaker?

please help me!

@hcy71o
Copy link
Owner

hcy71o commented Aug 14, 2024

First of all, I am not the author of the paper. The purpose of this repo is to test the model, and the authors’ implementation and dataset configuration may differ. Additionally, in the current code, there is no bias in tuning the model because only a single test sample is generated (link), and there is no validation loss calculation during the training process.

@hcy71o hcy71o closed this as completed Aug 14, 2024
@Mr47121836
Copy link
Author

Mr47121836 commented Aug 14, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants