Skip to content

Added a useful debugging output mode that prints a summary of every assembly region #6441

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 8 commits into
base: master
Choose a base branch
from

Conversation

jamesemery
Copy link
Collaborator

This is intended for debugging purposes.

@jamesemery jamesemery requested a review from lbergelson February 4, 2020 19:35
@gatk-bot
Copy link

gatk-bot commented Feb 4, 2020

Travis reported job failures from build 28970
Failures in the following jobs:

Test Type JDK Job ID Logs
unit openjdk11 28970.13 logs
unit openjdk8 28970.3 logs

@gatk-bot
Copy link

gatk-bot commented Feb 4, 2020

Travis reported job failures from build 28972
Failures in the following jobs:

Test Type JDK Job ID Logs
unit openjdk11 28972.13 logs
unit openjdk8 28972.3 logs

@droazen
Copy link
Contributor

droazen commented Feb 5, 2020

@davidbenjamin Care to have a look at this one?

Copy link
Contributor

@davidbenjamin davidbenjamin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Back to @jamesemery

List<VariantContext> overlappingVariants = featureContext.getValues(overlappingVariantInput);

outputTableWriter.getNewLineBuilder()
.setColumn("active_region", feature.get("active_region"))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there any way to write this more explicitly as appending the count of overlapping variants to the input feature?

final File inputAssemblyRegion = new File(TEST_FILES_DIR, "testAssemblyRegionOutput.table");
final File inputVCF = new File(TEST_FILES_DIR, "expected.testVCFMode.gatk4.vcf");

final String[] args = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An ArgumentsBuilder would be cleaner

if ( debugActiveRegionOutputStream != null ) {
debugActiveRegionOutputStream.getNewLineBuilder()
.setRow(new String[]{
assemblyRegion.getSpan().toString(),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What would you think about a boolean option to use getExtendedSpan, which would give you the intervals (possibly overlapping) over which the GATK assembled? I could imagine that being useful, too.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see two options here, both are potentially useful. I ask your opinion between the two. FIrst, I could add a second argument flag that dictates what level of padding gets included (which could possibly be expanded in the future). Second (and probably my preference) would be to add these as another column in the output of the file (say: unpadded_region) so the information would always be accessable.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's go with the second option.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is now an extra column in every case. This will allow people to use either line they please if they use their bash magic to extract it, which is ultimately the goal here.

@jamesemery
Copy link
Collaborator Author

@davidbenjamin back to you

@davidbenjamin
Copy link
Contributor

👍

@gatk-bot
Copy link

gatk-bot commented Feb 7, 2020

Travis reported job failures from build 29031
Failures in the following jobs:

Test Type JDK Job ID Logs
unit openjdk11 29031.13 logs
unit openjdk8 29031.3 logs

@jamesemery jamesemery force-pushed the je_addAssemblyRegionOutFileToGraph branch from 35fe0ec to 1221ebb Compare February 28, 2020 20:16
@gatk-bot
Copy link

gatk-bot commented Feb 28, 2020

Travis reported job failures from build 29307
Failures in the following jobs:

Test Type JDK Job ID Logs
unit openjdk11 29307.14 logs
variantcalling openjdk8 29307.4 logs
unit openjdk8 29307.3 logs

@gatk-bot
Copy link

gatk-bot commented Feb 28, 2020

Travis reported job failures from build 29311
Failures in the following jobs:

Test Type JDK Job ID Logs
unit openjdk11 29311.14 logs
unit openjdk8 29311.3 logs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants