Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

saving intermediate output from "grow" #181

Open
zoey-rw opened this issue May 27, 2024 · 1 comment
Open

saving intermediate output from "grow" #181

zoey-rw opened this issue May 27, 2024 · 1 comment

Comments

@zoey-rw
Copy link

zoey-rw commented May 27, 2024

I'm wondering what the best approach would be to save output from the "grow" workflow in case it does not complete for a large dataset. I'm splitting my manifest file by rows into smaller chunks, which produces many results files, and still sometimes stalls at 98-99% complete, which might just be due to one problematic sample (right?)

Would you recommend modifying the internal "_growth" function to save results right before the "return" call? Ideally, the growth results could then be combined to still be used as inputs for visualizations.

Thanks!

@cdiener
Copy link
Collaborator

cdiener commented May 29, 2024

Sorry to hear you're still having problems with hanging samples. Did you try version 0.35.0 by any chance? That at least should be more stable in the workflows. It kind of does what you are asking already in the sense that if a sample actually fails it will return the solution for all the others. But there is no caching or restart mechanism.

You could also manage it yourself because grow in principle works fine with a single sample as well. That could be integrated in Nextflow (this is planned from) or Snakemake to get what you are asking.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants