-
Notifications
You must be signed in to change notification settings - Fork 24
Description
Expected Behavior
-
I would like to use metaeuk easy-predict to taxtocontig workflow against UniRef90 to annotate contigs from a metagenome and output a blast outfmt 6 table as input for Blobtoolkit (decontamination of bacteria from a microeukaryote assembly).
-
its not clear from the documentation of Metaeuk and to produce a blast-like table without using mmseqs convertalis. The convertalis function will not work with the files produced from taxtocontig either. It says its the wrong database type (needs an alignment db). Attempted to add -a to convert to alignments with mmseqs convertalis module
Current Behavior
using the -a setting, the metaeuk easy-predict and taxtocontig workflow should produce an alignment file but it doesn't finish and complains about the column number in the input is incorrect?
**-a BOOL Add backtrace string (convert to alignments with mmseqs convertalis module) [0]**
at the end of easy-predict the script ERRORS with :
**_there should be 20 columns in the input file. This doesn't seem to be the case._**
Steps to Reproduce (for bugs)
Please make sure to execute the reproduction steps with newly recreated and empty tmp folders.
The sensitivity was set low to make it work quickly and the max-seqs reduced for speed and just for testing. But it usually is kept at 4.0.
metaeuk createdb prokarya_scaffolds.fasta prokarya_scaffoldsDB
metaeuk easy-predict prokarya_scaffoldsDB /home/hh.nku.edu/cooperjo/databases/MetaEuk_db/UniRef90 RESULTSprok tmpFOLDER -s 1 --metaeuk-eval 0.01 --max-seqs 25 -a
MetaEuk Output (for bugs)
Please make sure to also post the complete output of MetaEuk. You can use gist.github.com for large output.
https://gist.github.com/JoshuaTCooper/5f4f1280e767472ac524e836776a9495
Context
Providing context helps us come up with a solution and improve our documentation for the future.
-
I wish the documentation for Metaeuk was more specific and not just referring to MMSEQ. I've tried for a 3 days to interpret MMSEQs guide and I couldn't figure out how to use the -a boolean to output an alignDB (I think?). I seems the mmseqs taxonomy workflow is specific, and produce the correct? files for convertalis.
-
My goal was to create a blast outfmt 6 table to determine taxonomy of my metagenome contigs to be used within Blobtoolkit
Alternatively,
-
What steps would I run using mmseqs taxonomy to reproduce the settings within metaeuk easy-predict to taxtocontig workflow to create the files for mmseqs convertalis? I also tried using the step by step workflow starting with predicting exons, and got the same error message (FYI).
`` -
If I have run the metaeuk easy-predict and taxtocontig workflow and still have my temp folders, is there a way to extract that information in another way without re-running the full program to get a blast table?
Your Environment
Include as many relevant details about the environment you experienced the bug in.
- Git commit used (The string after "MetaEuk Version:" when you execute MetaEuk without any parameters):
- Which MetaEuk version was used (Statically-compiled, self-compiled, Homebrew, etc.):
- For self-compiled and Homebrew: Compiler and Cmake versions used and their invocation:
- Server specifications (especially CPU support for AVX2/SSE and amount of system memory):
- Operating system and version:
metaeuk Version: 6.a5d39d9
bioconda installed
CPU support for AVX2
256 GB RAM, 16 core server
Thanks in advance!
Josh