implement GVS ID assignment #7355

ahaessly · 2021-07-19T15:36:45Z

Assign Ids to samples.
Update the sample info table.
Update import wdl to remove the generation of the sample_info.tsv files

gatk-bot · 2021-07-30T15:13:56Z

Travis reported job failures from build 35222
Failures in the following jobs:

Test Type	JDK	Job ID	Logs
integration	openjdk11	35222.12	logs

kcibul

just a few comments, the id assignment part looks good!

kcibul · 2021-08-02T14:11:05Z

scripts/variantstore/wdl/GvsAssignIds.wdl

+workflow GvsAssignIds {
+
+  input {
+    Array[String] external_sample_names


Would this be more usable as a File that we then do a read_lines on? Or have both options?

we currently get this from the data model. is there a reason someone would use a file instead?

kcibul · 2021-08-02T14:12:29Z

scripts/variantstore/wdl/GvsAssignIds.wdl

+      echo "project_id = ~{project_id}" > ~/.bigqueryrc
+
+      # create the lock table
+      bq --project_id=~{project_id} mk ~{dataset_name}.metadata_lock


just checking, this script exits with an error of the metadata_lock table already exists?

also -- maybe we should name this differently now that this isn't going into the "metadata" table? sample_info_lock or sample_id_assignment_lock?

also also… I'm surprised that this doesn't have to specify the schema when the table is created?

will explicitly check whether the table exists and exit with error if it does.
renaming to sample_id_assignment_lock
adding schema when creating

scripts/variantstore/wdl/GvsImportGenomes.wdl

kcibul · 2021-08-02T14:20:48Z

scripts/variantstore/wdl/GvsImportGenomes.wdl

-  call GetMaxTableId {
-    input:
-      sample_map = sample_map
+  if (defined(sample_map)) {


What do we need the legacy mode for?

will leaving this in be helpful for anyone? I wasn't sure if people had tests that used a sample map file. Although, this won't update the sample_info table anymore, so maybe I should just remove it. If anyone else has an opinion, let me know.

…info

ahaessly force-pushed the ah_generate_id branch from 97bd366 to b0fe206 Compare July 20, 2021 22:26

ahaessly marked this pull request as ready for review July 22, 2021 15:03

ahaessly requested a review from kcibul July 30, 2021 20:46

ahaessly force-pushed the ah_generate_id branch from 5c38ec0 to 5295fa4 Compare July 30, 2021 20:47

kcibul approved these changes Aug 2, 2021

View reviewed changes

ahaessly added 9 commits August 9, 2021 11:51

implement GVS ID assignment

09db4e9

add force option to ignore dup samples and add new samples

e6f95ff

simplify assign ids

d0e494f

update create import tsvs to use bq for ids and not to create sample_…

285bda6

…info

document ingest wdl

7978000

fix table_id, fix disk_size

08f5403

allow sample_map to be passed in for backward compatibility

d478da4

fix table id to start at 1

74de48d

update from PR

6a05727

ahaessly force-pushed the ah_generate_id branch from 3144800 to 6a05727 Compare August 9, 2021 15:55

ahaessly merged commit 15bbb08 into ah_var_store Aug 11, 2021

ahaessly deleted the ah_generate_id branch August 11, 2021 17:43

This was referenced Mar 17, 2023

lb merge gvs branch #8248

Closed

testing something, please ignore #8251

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement GVS ID assignment #7355

implement GVS ID assignment #7355

ahaessly commented Jul 19, 2021 •

edited

Loading

gatk-bot commented Jul 30, 2021

kcibul left a comment

kcibul Aug 2, 2021

ahaessly Aug 4, 2021

kcibul Aug 2, 2021

kcibul Aug 2, 2021

kcibul Aug 2, 2021

ahaessly Aug 4, 2021

kcibul Aug 2, 2021

ahaessly Aug 4, 2021

implement GVS ID assignment #7355

implement GVS ID assignment #7355

Conversation

ahaessly commented Jul 19, 2021 • edited Loading

gatk-bot commented Jul 30, 2021

kcibul left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ahaessly commented Jul 19, 2021 •

edited

Loading