Document AoU SOP (up to the VAT) [VS-63] #7807

rsasch · 2022-04-25T15:59:39Z

No description provided.

mcovarr · 2022-04-25T17:38:39Z

scripts/variantstore/AOU_DELIVERABLES.md

-        4. the "cohort_extract_table_prefix" input from `GvsExtractCallset` step
-        5. the "filter_set_name" input from `GvsCreateFilterSet` step
+## Prerequisites
+- If this is the first time running the GVS pipeline in a particular Google billing project, use your GCP account team to create a support ticket for the BigQuery team that includes "enabling cluster metadata pruning support for the BQ Read API." This enables a pre-GA feature that dramatically reduces the amount of data scanned reducing both cost and runtime.


"enabling cluster metadata pruning support for the BQ Read API" means autopoking?

Actually, this is a separate issue (otherwise referred to as "whitelisting"). I need to confirm that this has been resolved.

What is 'pre-GA'?

It's a term that Google uses, I believe it means "General Audience".

scripts/variantstore/AOU_DELIVERABLES.md

codecov · 2022-04-25T20:12:35Z

Codecov Report

❗ No coverage uploaded for pull request base (ah_var_store@614a0f7). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff                @@
##             ah_var_store     #7807   +/-   ##
================================================
  Coverage                ?   86.295%           
  Complexity              ?     35191           
================================================
  Files                   ?      2170           
  Lines                   ?    164837           
  Branches                ?     17775           
================================================
  Hits                    ?    142246           
  Misses                  ?     16265           
  Partials                ?      6326

scripts/variantstore/AOU_DELIVERABLES.md

gbggrant · 2022-04-26T15:41:53Z

scripts/variantstore/AOU_DELIVERABLES.md

-        4. the "cohort_extract_table_prefix" input from `GvsExtractCallset` step
-        5. the "filter_set_name" input from `GvsCreateFilterSet` step
+## Prerequisites
+- If this is the first time running the GVS pipeline in a particular Google billing project, use your GCP account team to create a support ticket for the BigQuery team that includes "enabling cluster metadata pruning support for the BQ Read API." This enables a pre-GA feature that dramatically reduces the amount of data scanned reducing both cost and runtime.


What is 'pre-GA'?

scripts/variantstore/AOU_DELIVERABLES.md

Co-authored-by: George Grant <[email protected]>

mcovarr · 2022-04-25T18:20:45Z

scripts/variantstore/AOU_DELIVERABLES.md

+4. GvsCreateAltAllele
+5. GvsCreateFilterSet (see [naming conventions doc](https://docs.google.com/document/d/1pNtuv7uDoiOFPbwe4zx5sAGH7MyxwKqXkyrpNmBxeow) for guidance on what to name the filter set, which you will need to keep track of for the `GvsExtractCallset` WDL).
+6. GvsPrepareRangesCallset needs to be run twice, once with `control_samples` set to "true" (see [naming conventions doc](https://docs.google.com/document/d/1pNtuv7uDoiOFPbwe4zx5sAGH7MyxwKqXkyrpNmBxeow) for guidance on what to use for `extract_table_prefix`  or cohort prefix, which you will need to keep track of for the `GvsExtractCallset` WDL).
+7. GvsExtractCallset needs to be run twice, once with `control_samples` set to "true", and with the `filter_set_name` and `extract_table_prefix` from step 5 & 6.  Include a valid (and secure) "output_gcs_dir" parameter, which is where the VCF and interval list files  will go.


Suggested change

7. GvsExtractCallset needs to be run twice, once with `control_samples` set to "true", and with the `filter_set_name` and `extract_table_prefix` from step 5 & 6. Include a valid (and secure) "output_gcs_dir" parameter, which is where the VCF and interval list files will go.

7. GvsExtractCallset needs to be run twice, once with `control_samples` set to "true", and with the `filter_set_name` and `extract_table_prefix` from step 5 & 6. Include a valid (and secure) "output_gcs_dir" parameter, which is where the VCF and interval list files will go.

rsasch added 2 commits April 25, 2022 11:58

everything up to the VAT

a96d042

nit

fb6be8f

mcovarr reviewed Apr 25, 2022

View reviewed changes

PR review

fd13c5e

rsasch force-pushed the rsa_aou_sop branch from 1510cd3 to fd13c5e Compare April 25, 2022 20:01

more PR comments

4235d85

add VAT

393b9e0

RoriCremer reviewed Apr 25, 2022

View reviewed changes

scripts/variantstore/AOU_DELIVERABLES.md Show resolved Hide resolved

gbggrant approved these changes Apr 26, 2022

View reviewed changes

Update scripts/variantstore/AOU_DELIVERABLES.md

59be35b

Co-authored-by: George Grant <[email protected]>

mcovarr approved these changes Apr 26, 2022

View reviewed changes

rsasch merged commit ba7a26c into ah_var_store Apr 26, 2022

rsasch deleted the rsa_aou_sop branch April 26, 2022 18:43

This was referenced Mar 17, 2023

lb merge gvs branch #8248

Closed

testing something, please ignore #8251

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document AoU SOP (up to the VAT) [VS-63] #7807

Document AoU SOP (up to the VAT) [VS-63] #7807

rsasch commented Apr 25, 2022

mcovarr Apr 25, 2022

rsasch Apr 25, 2022

gbggrant Apr 26, 2022

rsasch Apr 26, 2022

codecov bot commented Apr 25, 2022 •

edited

Loading

gbggrant Apr 26, 2022

mcovarr Apr 25, 2022

	7. GvsExtractCallset needs to be run twice, once with `control_samples` set to "true", and with the `filter_set_name` and `extract_table_prefix` from step 5 & 6. Include a valid (and secure) "output_gcs_dir" parameter, which is where the VCF and interval list files will go.
	7. GvsExtractCallset needs to be run twice, once with `control_samples` set to "true", and with the `filter_set_name` and `extract_table_prefix` from step 5 & 6. Include a valid (and secure) "output_gcs_dir" parameter, which is where the VCF and interval list files will go.

Document AoU SOP (up to the VAT) [VS-63] #7807

Document AoU SOP (up to the VAT) [VS-63] #7807

Conversation

rsasch commented Apr 25, 2022

mcovarr Apr 25, 2022

Choose a reason for hiding this comment

rsasch Apr 25, 2022

Choose a reason for hiding this comment

gbggrant Apr 26, 2022

Choose a reason for hiding this comment

rsasch Apr 26, 2022

Choose a reason for hiding this comment

codecov bot commented Apr 25, 2022 • edited Loading

Codecov Report

gbggrant Apr 26, 2022

Choose a reason for hiding this comment

mcovarr Apr 25, 2022

Choose a reason for hiding this comment

codecov bot commented Apr 25, 2022 •

edited

Loading