Skip to content

Is metaviral mode's Length threshold flexible? #1437

Open
@handibles

Description

Is your feature request related to a problem? Please describe. For generic questions use Q&A section in the Discussions forum above.

Thanks for all the work devs.

I'm comparing metaviral SPAdes with standard SPAdes+virsorter2. Usually we end up placing a 10Kbp threshold on sequence length post-hoc, but I'm wondering how increasing the Length threshold (as mentioned in the paper) to a higher value would affect the graph assembly? From the paper:

We classify such a path as long if its length exceeds a threshold Length (the default value 1000 nucleotides) and high coverage if its coverage exceeds a threshold Coverage (the default value 5×). Long high-coverage paths represent putative sequences of linear viruses.

I've found similar looking thresholds in the hybrid_aligner defn in config.info, but that might be for hybrid assembly specifically:

long_read_threshold 1000

Describe the solution you'd like

  • should the length parameter be considered flexible? Can it be referenced from the CLI?
  • Would modifying the value above change the programme as intended, or are those parameters for long/short hybrid assembly? If not, is there another location this can be increased from 1,000 to (e.g.) 10,000?

Describe alternatives you've considered

Currently, holding off on brutalising the lovely code.

Additional context

Thanks for the help.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions