-
Notifications
You must be signed in to change notification settings - Fork 587
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ingest wip to be added to other var db code #6582
Merged
RoriCremer
merged 10 commits into
broadinstitute:ah_var_store
from
RoriCremer:rc-merge-attempt
May 8, 2020
Merged
Ingest wip to be added to other var db code #6582
RoriCremer
merged 10 commits into
broadinstitute:ah_var_store
from
RoriCremer:rc-merge-attempt
May 8, 2020
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
add pet creation to walker BRUH - cleanup the cword code update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder dont rebase me bruh Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele charaters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ i dont know, it finally fuckign works....seriously. also schema jsons testing wip cleanup update schema and pray update schema cuz im bad added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown add stringified interval list to metadata table fixup on interval blog rename use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file wip testing to be dropped? interval wip add testing materials wip which type should this be passed in asg md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker
RoriCremer
changed the title
Pr to merge my forked branch into Andrea's gatk branch!
Ingest wip to be added to other var db code
May 4, 2020
ahaessly
pushed a commit
that referenced
this pull request
Aug 13, 2020
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
ahaessly
pushed a commit
that referenced
this pull request
Sep 11, 2020
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
meganshand
pushed a commit
that referenced
this pull request
Oct 6, 2020
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Jan 29, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Jan 29, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Feb 1, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Feb 1, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
Marianie-Simeon
pushed a commit
that referenced
this pull request
Feb 16, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
Marianie-Simeon
pushed a commit
that referenced
this pull request
Feb 16, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Mar 9, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
kcibul
pushed a commit
that referenced
this pull request
Mar 9, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
mmorgantaylor
pushed a commit
that referenced
this pull request
Apr 6, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
mmorgantaylor
pushed a commit
that referenced
this pull request
Apr 6, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
mmorgantaylor
pushed a commit
that referenced
this pull request
Apr 6, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
mmorgantaylor
pushed a commit
that referenced
this pull request
Apr 6, 2021
* initial vet creation class * initial variant walker for jg * initial pet creation class add pet creation to walker update vet schema add headers enum and method to pet creation add xsv creation to loop update vet fix extra delim bug and show non-ref placeholder Also fixed `[.....]` strings in the vet and converted a lot of strings to constants plus a couple of TODOs Fix GT String to output index of alleles instead of allele characters fix enum bug add GQ band to drop as an optional param include end base code used to push PET and VET to BQ added ability to pass in what GQ state to interpret as "missing" as well as skeleton for creating the metdata tsv (sample list, what interval list was use, the interval list's md5, and what GQ is signified by "missing". Also put some util files in a folder under BlahVariantWalkerUtils for now (should get rid of them eventually but nice to have for now) spacing + adding optional to GQ arguments (whoops) rename intervalListPath to intervalListBlob pass thru sample name swap out errors thrown use L for interval list This doesn't have the level of validation that we need, but works for now Ideally the interval list is required to be a picard interval list file md5 interval list concated string wip split xsvs by contig add vet stripping intial changes review vet trim with Laura fix overzealous missing bug laura seal of approval on schema add back call_DP until we are sure evoq doesn't need it write chr dirs on the fly create dirs for chrs w pet and vet dirs inside add sample id to metadata semantics for sample id move position and sample into main method for easier access add sample id mapping and use it add enum for chr index use longs for the chr addition add array schema and enum add bool for array update array options with sample id and chr adjustment wip to remove chr separation new sample directory structure let the rename begin fixup on walker for arrays more renames better comments fixup on ingest walker * unrelated conflict--maybe merge mistake? * cleanup * swap method for indexed alleles * add genome specific VET creation * move mode enum into common code * wip on adding enums as params
Closed
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Pr to merge my forked branch into Andrea's gatk branch!