Skip to content

modkit extract takes very long #119

Open
@pkerbs

Description

@pkerbs

Hello,
im using modkit extract like this:

modkit extract \
  --reference <fastafile> \
  --include-bed <bedfile> \
  --threads 16 \
  --log-filepath <logfile> \
  <input> \
  <output>

However, it runs for a very long time (7 hours and more for ~4GB bam files).
I see that modkit uses more cores sometimes, but just for a short period of time and then it runs only on one core for most of the time.
The BED file that I am using contains the Illumina 850k array positions, so ~850k lines.
The BAM files are sorted, indexed and were generated using modkit adjust-mods --convert h m on the original dorado call BAM files.
What could be here the issue? Shouldn't it extract the modcalls much faster?
Thank you for your response.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestextractmodkit extract

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions