Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support for multiple PET/VETs #6969

Merged
merged 4 commits into from
Nov 19, 2020
Merged

support for multiple PET/VETs #6969

merged 4 commits into from
Nov 19, 2020

Conversation

kcibul
Copy link
Contributor

@kcibul kcibul commented Nov 19, 2020

No description provided.

Copy link
Contributor

@meganshand meganshand left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Couple of small questions, but looks good 👍

preemptible_tries = preemptible_tries,
docker = docker_final
input:
done = CreateImportTsvs.done[0],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does Cromwell wait until the full scatter of CreateImportTsvs is complete? Or will this run as soon as the first shard is done?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't change this -- so I assumed it was doing the right thing before (it does seem to wait) but I wonder if instead if "done" should be an array of all the inputs? @ahaessly how did you arrive at this solution?

Copy link
Contributor

@ahaessly ahaessly Nov 19, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kcibul i think that when i used done, it was just a true/false that was set at the end of a task. i don't think i used it after a scatter. but i think passing the array would be safest. i guess we would change the input param to be an Array of String? (if it's an array of file, we should set the meta param for localizationOptional to true so it doesn't try to localize them all)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I agree, we should pass in the whole Array.

@@ -178,10 +207,6 @@ task LoadData {

command <<<
set -x
set +e
# make sure dataset exists
bq ls --project_id ~{project_id} ~{dataset_name} > /dev/null
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this still fail with a nice error message if there's no dataset? (Nice enough that I'll realize I forgot to make one)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed this because later on when you go to make the table, it fails will a clear error message if the dataset doesn't exist

memory: "4 GB"
disks: "local-disk 10 HDD"
memory: "10 GB"
disks: "local-disk 1000 HDD"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why does loading the data need so much disk? Isn't it just importing what's in a separate google bucket?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no idea -- I changed it to match Terra, but you're right. I'm going to change it back :D

@gatk-bot
Copy link

Travis reported job failures from build 32174
Failures in the following jobs:

Test Type JDK Job ID Logs
conda openjdk8 32174.5 logs

@gatk-bot
Copy link

Travis reported job failures from build 32176
Failures in the following jobs:

Test Type JDK Job ID Logs
conda openjdk8 32176.5 logs

@kcibul kcibul merged commit 7399f7d into ah_var_store Nov 19, 2020
@kcibul kcibul deleted the kc_import_genomes_wdl branch November 19, 2020 20:39
@gatk-bot
Copy link

Travis reported job failures from build 32201
Failures in the following jobs:

Test Type JDK Job ID Logs
conda openjdk8 32201.5 logs

kcibul added a commit that referenced this pull request Jan 29, 2021
* support for multiple PET/VETs
* fixed to use latest GATK JAR, eliminate odd java NoSuchMethod error
* PR feedback
kcibul added a commit that referenced this pull request Feb 1, 2021
* support for multiple PET/VETs
* fixed to use latest GATK JAR, eliminate odd java NoSuchMethod error
* PR feedback
Marianie-Simeon pushed a commit that referenced this pull request Feb 16, 2021
* support for multiple PET/VETs
* fixed to use latest GATK JAR, eliminate odd java NoSuchMethod error
* PR feedback
kcibul added a commit that referenced this pull request Mar 9, 2021
* support for multiple PET/VETs
* fixed to use latest GATK JAR, eliminate odd java NoSuchMethod error
* PR feedback
mmorgantaylor pushed a commit that referenced this pull request Apr 6, 2021
* support for multiple PET/VETs
* fixed to use latest GATK JAR, eliminate odd java NoSuchMethod error
* PR feedback
This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants