Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.lab and .marker files corresponding words not match for Hindi #19

Open
shrishtis7 opened this issue Jul 21, 2023 · 0 comments
Open

.lab and .marker files corresponding words not match for Hindi #19

shrishtis7 opened this issue Jul 21, 2023 · 0 comments

Comments

@shrishtis7
Copy link

I have defined the Hindi letters (in Unicode characters) in the sympol.py file and also the phone set. While updating the marker file, I am receiving a warning " Correspondance issue between words in the .lab sentence and those in .markers file". I suppose because of this warning the following steps in the feature extraction process are giving errors. Some of the characters called dependent vowels in the Hindi language are not correctly recognised in the lab file. I have attached my symbols.py file. Please help me with this, as I require this to work in the Hindi language. I am confused about what I am missing. What changes are needed so that I can move past this error and move on to the training step?
symbols.txt
Error_pre_process
extract_features.txt

This is the sample format of lab and marker files I have received respectively.
lab file [rec-13]: उन्नीस,सौ,अड़तीस,में,नाज़ी,जर्मनी,ने,ऑस्ट्रिया,पर,कब्ज़ा,कर,लिया,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
marker file[rec-13]:
marker.txt.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant