Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VAT Readme updates #8090

Merged
merged 5 commits into from
Dec 5, 2022
Merged

VAT Readme updates #8090

merged 5 commits into from
Dec 5, 2022

Conversation

RoriCremer
Copy link
Contributor

@RoriCremer RoriCremer commented Nov 10, 2022

Update the VAT pipeline readme to know about the VDS inputs

This will need to further be edited once George's VDS VAT pipeline changes have been completed

scripts/variantstore/variant_annotations_table/README.md Outdated Show resolved Hide resolved
scripts/variantstore/variant_annotations_table/README.md Outdated Show resolved Hide resolved
scripts/variantstore/variant_annotations_table/README.md Outdated Show resolved Hide resolved
The third input is the ancestry file from the ancestry pipeline which will be used to calculate AC, AN and AF for all subpopulations. It needs to be copied into a GCP bucket that this pipeline will have access to. This input has been labelled as the `ancestry_file`.

Most of the other files are specific to where the VAT will live, like the project_id and dataset_name and the table_suffix which will name the VAT itself as vat_`table_suffix` as well as a GCP bucket location, the output_path, for the intermediary files and the VAT export in tsv form.

All optional inputs are provided with default values.

### Preparing to run GvsCreateVATfromVDS:
Three inputs need to be created from the VDS.
The third input can be created using the ancestry pipeline which is in another workspace.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

where?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I dunno

### Preparing to run GvsCreateVATfromVDS:
Three inputs need to be created from the VDS.
The third input can be created using the ancestry pipeline which is in another workspace.
The first and second inputs can be creating using this python script:`scripts/variantstore/wdl/extract/hail_create_vat_inputs.py`
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

must this be a cluster or could this be done in a WDL after copying down the inputs?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good question---might be worth asking George actually. I'm not sure which he did

@RoriCremer RoriCremer marked this pull request as ready for review December 5, 2022 21:39
@RoriCremer RoriCremer merged commit 290fd23 into ah_var_store Dec 5, 2022
@RoriCremer RoriCremer deleted the rc-vds-vat-readme-update branch December 5, 2022 21:52
@gatk-bot
Copy link

gatk-bot commented Dec 5, 2022

Github actions tests reported job failures from actions build 3624257957
Failures in the following jobs:

Test Type JDK Job ID Logs
cloud 11 3624257957.11 logs
integration 11 3624257957.12 logs

This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants