Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Continue training on model #25

Closed
vlangloisx3 opened this issue Sep 25, 2020 · 8 comments
Closed

Continue training on model #25

vlangloisx3 opened this issue Sep 25, 2020 · 8 comments
Assignees
Labels

Comments

@vlangloisx3
Copy link

I've been having some issues with the --continueParams option. The code that I am using to continue training a model is below:

uuparser --outdir ewt_output --datadir ud-treebanks-v2.6 --include "en_ewt" --continue --continueParams=params.pickle

My assumption is that the params file is in the same folder as the outdir, but I get the error: No such file or directory: 'params.pickle'. I've also tried inserting the actual path, but get the error: UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte.

Since neither of these seem to work, where does the Params file need to be in order to load it?

Thanks!
Val

@LoicGrobol
Copy link
Collaborator

Hi ! Can you paste the command line you used when using the actual path and the traceback for the UnicodeDecodeError?

@vlangloisx3
Copy link
Author

Sure! Here is the command line I used when using the path:
uuparser --outdir ewt_output --datadir ud-treebanks-v2.6 --include "en_ewt" --continue --continueParams=/Users/vlanglois/Desktop/project/ewt_output/en_ewt/params.pickle

and here is the traceback:
Traceback (most recent call last): File "/Library/anaconda3/bin/uuparser", line 8, in <module> sys.exit(main()) File "/Library/anaconda3/lib/python3.7/site-packages/uuparser/parser.py", line 279, in main run(experiment,options) File "/Library/anaconda3/lib/python3.7/site-packages/uuparser/parser.py", line 36, in run stored_vocab, stored_options = pickle.load(paramsfp) File "/Library/anaconda3/lib/python3.7/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte

@LoicGrobol
Copy link
Collaborator

LoicGrobol commented Oct 9, 2020

(Sorry for the latency, I am really low-bandwidth on uuparser right now but I'll try to fix this :-)

Can you try installing from the master branch (using pip install git+https://github.com/UppsalaNLP/uuparser)? There were some changes since the last release, which I suspect won't make a difference for this but it wouldn't hurt to be sure and at least we'd have up-to-date line numbers this way

Actually, I think I found the issue, can you try installing from the fixup branch using pip install git+https://github.com/UppsalaNLP/uuparser@fix-25 and see if it fixes the issue?

@LoicGrobol LoicGrobol self-assigned this Oct 9, 2020
@LoicGrobol LoicGrobol added the bug label Oct 9, 2020
@vlangloisx3
Copy link
Author

I'm still getting the same error even after installing from the fixup branch. It could be something wrong on my end though.

(Thanks for taking the time to look into it!)

@LoicGrobol
Copy link
Collaborator

Oh, if you installed with the exact command I posted in an env where uuparser is already installed, it might not have been updated because I forgot the -U flag, could you retry with pip install -U git+https://github.com/UppsalaNLP/uuparser@fix-25? Sorry for the inconvenience

@vlangloisx3
Copy link
Author

Yes that solved the issue! Thank you so much!

@LoicGrobol
Copy link
Collaborator

Great, I'll merge the fix and push a release before the end of the week, then. Thank you for catching this one :-)

@LoicGrobol
Copy link
Collaborator

Closed by #26

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants