Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VET Ingest Validation / Allow Ingest of non-VQSR'ed data #7870

Merged
merged 3 commits into from
Jun 2, 2022

Conversation

gbggrant
Copy link
Collaborator

  • Added parameter to CreateVariantIngestFiles to allow the user to skip VQSR-specific fields.

@codecov
Copy link

codecov bot commented May 24, 2022

Codecov Report

❗ No coverage uploaded for pull request base (ah_var_store@91c33df). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff                @@
##             ah_var_store     #7870   +/-   ##
================================================
  Coverage                ?   86.291%           
  Complexity              ?     35191           
================================================
  Files                   ?      2170           
  Lines                   ?    164888           
  Branches                ?     17786           
================================================
  Hits                    ?    142284           
  Misses                  ?     16280           
  Partials                ?      6324           

@gbggrant
Copy link
Collaborator Author

Successfully ran the QuickStartIntegration here: https://job-manager.dsde-prod.broadinstitute.org/jobs/e13334c6-e6a0-4f17-aad5-62479cc4f878

@gbggrant
Copy link
Collaborator Author

gbggrant commented May 26, 2022

Ran the QuickStartIntegration here telling it to skip VQSR specific fields - https://job-manager.dsde-prod.broadinstitute.org/jobs/bed26dc5-c653-4c56-be82-a668d65a9560 - but it passed comparison.

row.add(sampleId);
} else {
}
else if (!skipLoadingVqsrFields || !fieldEnum.isVqsrSpecificField()) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I found this a little hard to follow mentally… what about?

} else if ( !(skipLoadingVqsrFields && fieldEnum.isVqsrSpecificField() ) ) {

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks - that is clearer. I've updated it.

Copy link
Contributor

@kcibul kcibul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

love this implementation -- so elegant (assuming it works!)

- Added validation for presence of GT and GQ
- Added parameter to CreateVariantIngestFiles to allow the user to skip VQSR-specific fields.
- Push 'skip_loading_vqsr_fields' up to be a top-level input
@gbggrant gbggrant force-pushed the gg_VS-443_VETIngestValidation branch from b3ca40d to 83a83ad Compare June 1, 2022 14:00
@@ -240,6 +240,11 @@ task GetBQTablesMaxLastModifiedTimestamp {
}

task BuildGATKJarAndCreateDataset {
# Since this might be called repeatedly on the same branch (and the latest commit might have been updated), so we never call cache.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This addresses VS-455

@gbggrant gbggrant marked this pull request as ready for review June 1, 2022 14:06
@gbggrant gbggrant requested a review from mcovarr June 1, 2022 14:06
@gbggrant
Copy link
Collaborator Author

gbggrant commented Jun 1, 2022

@gbggrant gbggrant changed the title Start on VET Ingest Validation VET Ingest Validation / Allow Ingest of non-VQSR'ed data Jun 2, 2022
Copy link

@rsasch rsasch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 👍🏻

@@ -91,9 +88,11 @@ public List<String> createRow(final long location, final VariantContext variant,
for ( final VetFieldEnum fieldEnum : VetFieldEnum.values() ) {
if (fieldEnum.equals(VetFieldEnum.location)) {
row.add(String.valueOf(location));
} else if (fieldEnum.equals(VetFieldEnum.sample_id)) {
}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The if/else reformatting here and below is actually contrary to what my IntelliJ would do; is there a reason for this?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, that's me. I followed a different style guide in a former life and don't like the way it looks like that. But I'll stick with what IJ does.

// KC: we are seeing a TON of these!
// if (!out.endsWith("|0.00")) {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

commented out code

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, @kcibul commented out the logger statement, I just expanded the commenting out to take out all the functional code.

Comment on lines -67 to -71
List<String> row = createRow(
location,
variant,
sampleId
);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TOL perhaps rather than deleting this and inlining the createRow on line 80 this could just be moved into the TSV case?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure I understand. Do you not like the inlining and want the code block to remain explicit in the TSV case?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, I was suggesting to keep this block explicit but only in the case TSV: since that's the only spot it's used.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@gbggrant gbggrant merged commit 4d30135 into ah_var_store Jun 2, 2022
@gbggrant gbggrant deleted the gg_VS-443_VETIngestValidation branch June 2, 2022 20:28
This was referenced Mar 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants