You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8-7Lines changed: 8 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,11 +31,11 @@ A **gene call** is an optimal set of exons predicted based on similarity to a sp
31
31
## Running MetaEuk
32
32
### Main Modules:
33
33
34
+
easy-predict Predict proteins from contigs (fasta/db) based on similarities to targets (fasta/db) and return a fasta
34
35
predictexons Call optimal exon sets based on protein similarity
35
36
reduceredundancy Cluster metaeuk calls which share an exon and select representative
36
37
unitesetstofasta Create a fasta output from optimal exon sets
37
38
groupstoacc Create a TSV output from representative to calls
38
-
easy-predict Predict proteins from contigs (fasta/db) based on similarities to targets (fasta/db) and return a fasta
39
39
40
40
41
41
### Important parameters:
@@ -47,6 +47,13 @@ A **gene call** is an optimal set of exons predicted based on similarity to a sp
47
47
--slice-search if refernceDB is a profile database, should be added
48
48
49
49
50
+
### easy-predict workflow:
51
+
52
+
This workflow combines the following MetaEuk modules into a single step: predictexons, reduceredundancy and unitesetstofasta (each of which is detailed below). Its input are contigs (either as a Fasta file or a previously created database) and targets (either as a Fasta file of protein sequences or a previously created database of proteins or protein profiles). It will run the modules and output the predictions in Fasta foramt.
This module will extract all putative protein fragments from each contig and strand, query them against the reference targets and use dynamic programming to retain for each **T** the optimal compatible exon set from each **C** & **S** (thus creating **TCS** calls).
@@ -102,12 +109,6 @@ can help mapping from each representative prediction after the redundancy reduct
This workflow combines the following MetaEuk modules into a single step: predictexons, reduceredundancy and unitesetstofasta. Its input are contigs (either as a Fasta file or a previously created database) and targets (either as a Fasta file of protein sequences or a previously created database of proteins or protein profiles). It will run the modules and output the predictions in Fasta foramt.
Compiling MetaEuk from source has the advantage that it will be optimized to the specific system, which should improve its performance. To compile MetaEuk `git`, `g++` (4.6 or higher) and `cmake` (3.0 or higher) are required. Afterwards, the MetaEuk binary will be located in the `build/bin` directory.
0 commit comments