-
Notifications
You must be signed in to change notification settings - Fork 26
Fix VertexAICustomTrainingJob failing to cancel #205
Conversation
Thanks for opening a PR @jeremy-thomas-roc! If this error occurs when canceling via the Prefect UI, then my hunch is that the bug is in the |
So in going through the Vertex logs, this is what I see. Note the change in timestamps on the left. The exception doesn't occur until the job is canceled via the Vertex UI. Prior to that, it logs nothing. I have attached the exception here as well, it seems that the prefect cancelation is there, but again, not until the job is manually canceled in the Vertex UI. Did I miss where the |
Agents are responsible for calling the We need more info on why these Vertex jobs are hanging. I'd expect Vertex to teardown the job if they succeeded or failed. In the meantime, we can expose a |
@desertaxle great, this looks like it will work perfectly. I will work on implementing this instead of the explicit Actually, this already exists in the block. |
Uses the existing status code check at the end of
run
to callkill
, which has seemingly been implemented, but never called.I'm not exactly sure how I would go about writing a test for this, happy to do it if provided some direction for the best way to do this.
Closes
PrefectHQ/prefect#13056
Screenshots
No update to the docs
Checklist
pre-commit
checks.pre-commit install && pre-commit run --all
locally for formatting and linting.mkdocs serve
view documentation locally.