-
Notifications
You must be signed in to change notification settings - Fork 587
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
check for duplicate ids #7273
check for duplicate ids #7273
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM but should probably get another look from someone with more BQ / bash expertise
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
|
||
if [ ~{has_service_account_file} = 'true' ]; then | ||
gcloud auth activate-service-account --key-file='~{service_account_json}' | ||
gcloud config set project ~{project_id} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should we also set the environment variable here? I don't know if it's needed but just to be consistent everywhere
export GOOGLE_APPLICATION_CREDENTIALS=~{service_account_json}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we only need to set the environment variable when the gatk tool needs to access bigquery. Also, I believe (but have not verified) that when we set it, we have to localize all input files that are not in the aou project. Meaning that if we try to pass a gs path to the GATK that is in a workspace bucket, it won't be able to read the file.
fi | ||
|
||
# true if there is data in results | ||
if [ -s duplicates ]; then |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice -- I learned something new!
query the partition table to see if samples have already been loaded