Porting to PyTorch #3

nithya4 · 2017-12-11T18:25:49Z

I tried porting the code to PyTorch, specifically the Tensorflow version. The latter works perfectly.
But with PyTorch when I use optim.LBFGS, I run into exploding gradients/no updates on the target.
The error is in the update:
151
152 #update scale of initial Hessian approximation
--> 153 H_diag = ys / y.dot(y) # (y*y)
154
155 # compute the approximate (L-BFGS) inverse Hessian

ZeroDivisionError: float division by zero

I am running this on CPU only for the time being - macOS 10.13.1 64 bit.
Any insights on why this may be happening?

The text was updated successfully, but these errors were encountered:

sidak · 2018-05-11T12:38:21Z

Hi @nithya4 ! Were you able to resolve this issue? Thanks! :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Porting to PyTorch #3

Porting to PyTorch #3

nithya4 commented Dec 11, 2017 •

edited

Loading

sidak commented May 11, 2018

Porting to PyTorch #3

Porting to PyTorch #3

Comments

nithya4 commented Dec 11, 2017 • edited Loading

sidak commented May 11, 2018

nithya4 commented Dec 11, 2017 •

edited

Loading