Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vs 741 fix indefinite freeze in split intervals task when using exome data #8113

Conversation

koncheto-broad
Copy link

@koncheto-broad koncheto-broad commented Nov 29, 2022

This PR fixes two bugs.

First, the SplitIntervals task would enter WeightedSplitIntervals and hang. I added an extra boolean argument to extract so you can specify that no, you really don't want to use a weighted bed. Relatedly, the code branch for running the original GATK SplitIntervals code wasn't correct, as passing weight-bed-file to it as an argument caused a failure. It uses a slightly hacky method of defining a string in WDL to be empty or not depending on if we use weighted beds, interpolating that string into the bash, then checking to see if it's empty there to transmit that state. There is likely a cleaner way to do this, and in the next revision I will likely rewrite this part cleaner.

Second, after SplitIntervals passed we hit an error during ExtractTask. The way it expanded intervals to handle large deletions could sometimes subtract past the start of a chromosome, so that logic needed to be patched in a few separate places to handle the interval for the mitochondrial dna that started much closer to the beginning (instead of having a 10k base pair buffer). This PR has those changes too.

Successful run here: https://app.terra.bio/#workspaces/gvs-dev/GVS%20Exome%20Test/job_history/a006a959-9300-42cf-84a7-38c70a35ee21

Successful run after incorporating PR changes: https://app.terra.bio/#workspaces/gvs-dev/GVS%20Exome%20Test/job_history/e2ee3abd-288e-4f1d-b5be-f78cf5400ce9

Successful run after last PR refactoring that allowed me to revert almost all changes to GvsUtils.SplitIntervals: https://app.terra.bio/#workspaces/gvs-dev/GVS%20Exome%20Test/job_history/94fed63a-98ca-466e-8d4c-ac97f24adf37

@codecov
Copy link

codecov bot commented Nov 29, 2022

Codecov Report

❗ No coverage uploaded for pull request base (ah_var_store@aa97a09). Click here to learn what that means.
The diff coverage is n/a.

Additional details and impacted files
@@               Coverage Diff                @@
##             ah_var_store     #8113   +/-   ##
================================================
  Coverage                ?   86.170%           
  Complexity              ?     35132           
================================================
  Files                   ?      2173           
  Lines                   ?    165045           
  Branches                ?     17794           
================================================
  Hits                    ?    142220           
  Misses                  ?     16479           
  Partials                ?      6346           

@gatk-bot
Copy link

gatk-bot commented Nov 29, 2022

Github actions tests reported job failures from actions build 3577955591
Failures in the following jobs:

Test Type JDK Job ID Logs
unit 11 3577955591.13 logs
cloud 11 3577955591.11 logs
integration 11 3577955591.12 logs

@gatk-bot
Copy link

Github actions tests reported job failures from actions build 3577966142
Failures in the following jobs:

Test Type JDK Job ID Logs
integration 11 3577966142.12 logs

…it against unset variable issues

Also adding branch back to dockstore so I can run an extract one last time just to verify the changes
@gatk-bot
Copy link

Github actions tests reported job failures from actions build 3578304201
Failures in the following jobs:

Test Type JDK Job ID Logs
unit 11 3578304201.13 logs

@gatk-bot
Copy link

gatk-bot commented Nov 29, 2022

Github actions tests reported job failures from actions build 3578374711
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud 11 3578374711.11 logs
integration 11 3578374711.12 logs

koncheto-broad and others added 2 commits November 30, 2022 09:34
…ExtractCohortEngine.java


Sure, this is more concise

Co-authored-by: Miguel Covarrubias <[email protected]>
…earer, but it's fewer lines and the comment should make clear what it's doing
@gatk-bot
Copy link

gatk-bot commented Nov 30, 2022

Github actions tests reported job failures from actions build 3584250201
Failures in the following jobs:

Test Type JDK Job ID Logs
unit 11 3584250201.13 logs
integration 11 3584250201.12 logs

scripts/variantstore/wdl/GvsUtils.wdl Outdated Show resolved Hide resolved
scripts/variantstore/wdl/GvsUtils.wdl Outdated Show resolved Hide resolved
@gatk-bot
Copy link

gatk-bot commented Nov 30, 2022

Github actions tests reported job failures from actions build 3586714857
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud 11 3586714857.11 logs
unit 11 3586714857.13 logs

@gatk-bot
Copy link

gatk-bot commented Nov 30, 2022

Github actions tests reported job failures from actions build 3587103510
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud 11 3587103510.11 logs
unit 11 3587103510.13 logs

Copy link
Collaborator

@mcovarr mcovarr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

command <<<
# Updating to use standard shell boilerplate
PS4='\D{+%F %T} \w $ '
set -o errexit -o nounset -o pipefail -o xtrace
set -e
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this line can go now as it's the terse form of the set -o errexit above

@koncheto-broad koncheto-broad merged commit a69661b into ah_var_store Dec 1, 2022
@koncheto-broad koncheto-broad deleted the VS-741-fix-indefinite-freeze-in-split-intervals-task-when-using-exome-data branch December 1, 2022 14:02
This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants