-
Notifications
You must be signed in to change notification settings - Fork 603
Added a useful debugging output mode that prints a summary of every assembly region #6441
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
@davidbenjamin Care to have a look at this one? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Back to @jamesemery
...r/tools/walkers/haplotypecaller/HaplotypeCallerReadThreadingAssemblerArgumentCollection.java
Outdated
Show resolved
Hide resolved
...itute/hellbender/tools/walkers/haplotypecaller/ReadThreadingAssemblerArgumentCollection.java
Outdated
Show resolved
Hide resolved
...itute/hellbender/tools/walkers/haplotypecaller/ReadThreadingAssemblerArgumentCollection.java
Outdated
Show resolved
Hide resolved
...institute/hellbender/tools/walkers/haplotypecaller/readthreading/ReadThreadingAssembler.java
Outdated
Show resolved
Hide resolved
...broadinstitute/hellbender/tools/walkers/variantutils/SummarizeActiveRegionOutAgainstVCF.java
Outdated
Show resolved
Hide resolved
List<VariantContext> overlappingVariants = featureContext.getValues(overlappingVariantInput); | ||
|
||
outputTableWriter.getNewLineBuilder() | ||
.setColumn("active_region", feature.get("active_region")) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there any way to write this more explicitly as appending the count of overlapping variants to the input feature?
.../broadinstitute/hellbender/tools/walkers/haplotypecaller/HaplotypeCallerIntegrationTest.java
Outdated
Show resolved
Hide resolved
...hellbender/tools/walkers/variantutils/SummarizeActiveRegionOutAgainstVCFIntegrationTest.java
Show resolved
Hide resolved
final File inputAssemblyRegion = new File(TEST_FILES_DIR, "testAssemblyRegionOutput.table"); | ||
final File inputVCF = new File(TEST_FILES_DIR, "expected.testVCFMode.gatk4.vcf"); | ||
|
||
final String[] args = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
An ArgumentsBuilder
would be cleaner
if ( debugActiveRegionOutputStream != null ) { | ||
debugActiveRegionOutputStream.getNewLineBuilder() | ||
.setRow(new String[]{ | ||
assemblyRegion.getSpan().toString(), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What would you think about a boolean option to use getExtendedSpan
, which would give you the intervals (possibly overlapping) over which the GATK assembled? I could imagine that being useful, too.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see two options here, both are potentially useful. I ask your opinion between the two. FIrst, I could add a second argument flag that dictates what level of padding gets included (which could possibly be expanded in the future). Second (and probably my preference) would be to add these as another column in the output of the file (say: unpadded_region) so the information would always be accessable.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's go with the second option.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is now an extra column in every case. This will allow people to use either line they please if they use their bash magic to extract it, which is ultimately the goal here.
31cdd0d
to
35fe0ec
Compare
@davidbenjamin back to you |
👍 |
…data for the future Created a dummy tool to add a new annotation to my table file so I can produce a pretty plot added a hack to make the reading work
35fe0ec
to
1221ebb
Compare
This is intended for debugging purposes.