Description
I am trying to convert an existing .pod5 file that uses VBZ compression to a .pod5 file using ZSTD compression using the pod5 convert CLI.
For example, I tried:
pod5 convert to_pod5 --compression zstd input.pod5 output.zstd.pod5
or
pod5 convert --compression zstd input.pod5 output.zstd.pod5
But I always get the following error:
usage: pod5 convert [-h] {fast5,from_fast5,to_fast5} ...
pod5 convert: error: argument {fast5,from_fast5,to_fast5}: invalid choice: 'to_pod5' (choose from 'fast5', 'from_fast5', 'to_fast5')
Environment
- pod5 version: 0.3.34
- Python version: 3.12
- OS: Linux
- Source: Installed via pip from Tsinghua mirror
- Input file: VBZ-compressed pod5
- Goal: ZSTD-compressed pod5 for downstream Dorado basecalling
Question
- Is there currently any CLI way to convert an existing VBZ-compressed
.pod5 to ZSTD-compressed .pod5?
- If not, is this planned for a future release?
- Would the recommended approach be using the Python API with
Reader/Writer to re-compress?
Additional context
- I have the original
.pod5 file, but not the original .fast5 files anymore.
- I want to avoid re-basecalling from scratch if possible.
Thank you for your guidance!