Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

check for duplicate ids #7273

Merged
merged 5 commits into from
May 27, 2021
Merged

check for duplicate ids #7273

merged 5 commits into from
May 27, 2021

Conversation

ahaessly
Copy link
Contributor

@ahaessly ahaessly commented May 24, 2021

query the partition table to see if samples have already been loaded

Copy link

@rsasch rsasch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but should probably get another look from someone with more BQ / bash expertise

Copy link
Contributor

@kcibul kcibul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm


if [ ~{has_service_account_file} = 'true' ]; then
gcloud auth activate-service-account --key-file='~{service_account_json}'
gcloud config set project ~{project_id}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we also set the environment variable here? I don't know if it's needed but just to be consistent everywhere

            export GOOGLE_APPLICATION_CREDENTIALS=~{service_account_json}

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we only need to set the environment variable when the gatk tool needs to access bigquery. Also, I believe (but have not verified) that when we set it, we have to localize all input files that are not in the aou project. Meaning that if we try to pass a gs path to the GATK that is in a workspace bucket, it won't be able to read the file.

scripts/variantstore/wdl/GvsImportGenomes.wdl Outdated Show resolved Hide resolved
fi

# true if there is data in results
if [ -s duplicates ]; then
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice -- I learned something new!

@ahaessly ahaessly merged commit 2f43ae2 into ah_var_store May 27, 2021
@ahaessly ahaessly deleted the ah_check_for_dupes branch May 27, 2021 12:44
This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants