Skip to content

[New pipeline] RepeatMaskMyGenome #33

@Juke34

Description

@Juke34

See #17 for the general picture.

Maybe can be merge with the DeNovoRepeatLib pipeline (see #32).

The purpose of RepeatMaskMyGenome is to repeat mask a genome based on a repeat library (made de novo or provided within a lib e.g dfam or RepBase).
Having this pipeline could help to move easily to other annotation tool than MAKER if needed.

This pipeline consist in mains 3 steps:

  • split the genome into chunks (overlap or not? we can see in MAKER code to see how they do.)
  • Mask the chunks
  • Merge the annotation of the different chunks (if overlapping chunks we need to find a way to resolve the merge in a good way)

Input:

  • Path to a lib (fasta file is de-novo a name if from DFam or RepBase)
  • genome
  • param for the split size

Output: single gff file + stats

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions