Skip to content

Commit c852884

Browse files
Jon PalmerJon Palmer
Jon Palmer
authored and
Jon Palmer
committed
update to docs
1 parent b4d320b commit c852884

File tree

3 files changed

+65
-12
lines changed

3 files changed

+65
-12
lines changed

docs/clean.rst

+16-1
Original file line numberDiff line numberDiff line change
@@ -3,5 +3,20 @@
33

44
Cleaning your Assembly
55
================================
6-
When working with haploid assemblies, sometimes you want to remove some repetitive contigs that are contained in other scaffolds of the assembly. If the repeats are indeed unique, then we want to keep them in the assembly. Funannotate can help "clean" up repetitive contigs in your assembly
6+
When working with haploid assemblies, sometimes you want to remove some repetitive contigs that are contained in other scaffolds of the assembly. If the repeats are indeed unique, then we want to keep them in the assembly. Funannotate can help "clean" up repetitive contigs in your assembly. This is done using a "leave one out" methodology using Mummer (nucmer), where the the shortest contigs/scaffolds are aligned to the rest of the assembly to determine if it is repetitive. The script loops through the contigs starting with the shortest and workings its way to the N50 of the assembly, dropping contigs/scaffolds that are greater than the percent coverage of overlap (:code:`--cov`) and the percent identity of overlap (:code:`--pident`).
7+
8+
.. code-block:: none
79
10+
$ funannotate clean
11+
Usage: funannotate clean <arguments>
12+
version: 1.0.0
13+
14+
Description: The script sorts contigs by size, starting with shortest contigs it uses Mummer
15+
to find contigs duplicated elsewhere, and then removes duplicated contigs.
16+
17+
Arguments: -i, --input Multi-fasta genome file (Required)
18+
-o, --out Cleaned multi-fasta output file (Required)
19+
-p, --pident Percent identity of overlap. Default = 95
20+
-c, --cov Percent coverage of overlap. Default = 95
21+
-m, --minlen Minimum length of contig to keep. Default = 500
22+
--exhaustive Test every contig. Default is to stop at N50 value.

docs/conda.rst

+48-10
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,10 @@
33

44
Conda mediated Installation
55
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
6-
I'd really like to build a bioconda installation package, but would need some help. You can however install nearly all of the dependencies with conda.
6+
7+
** This is still under construction **
8+
9+
I'd really like to build a bioconda installation package, but would need some help. You can however install quite a few of the dependencies with conda.
710

811
.. code-block:: none
912
@@ -31,8 +34,31 @@ This will automatically install most of the dependencies.
3134

3235
1. Download/install GeneMark-ES/ET: (gmes_petap.pl must be in PATH)
3336
http://exon.gatech.edu/GeneMark/license_download.cgi
37+
38+
2. Download/install Bamtools and Augustus
39+
40+
If you are on linux, you can install the usual way:
41+
42+
.. code-block:: none
43+
44+
#install bamtools
45+
wget https://github.com/pezmaster31/bamtools/archive/v2.5.0.tar.gz && \
46+
tar -zxvf v2.5.0.tar.gz && \
47+
rm v2.5.0.tar.gz && mv bamtools-2.5.0 bamtools && \
48+
cd bamtools && mkdir build && cd build && \
49+
cmake .. && make && sudo make install && \
50+
cd /usr/include && sudo ln -f -s ../local/include/bamtools/ && \
51+
cd /usr/lib/ && sudo ln -f -s /usr/local/lib/bamtools/libbamtools.* .
52+
53+
#install augustus
54+
wget http://bioinf.uni-greifswald.de/augustus/binaries/old/augustus-3.2.3.tar.gz && \
55+
tar -zxvf augustus-3.2.3.tar.gz && rm augustus-3.2.3.tar.gz && \
56+
mv augustus-3.2.3 augustus && cd augustus && make clean && make
57+
58+
If you are on Mac, install using this version: https://github.com/nextgenusfs/augustus
59+
3460

35-
2. Install python modules via PIP or conda:
61+
3. Install python modules via PIP or conda:
3662

3763
.. code-block:: none
3864
@@ -45,26 +71,38 @@ This will automatically install most of the dependencies.
4571
conda install -c etetoolkit ete3 ete_toolchain
4672
4773
48-
3. Install RepeatMasker/RepeatModler and corresponding Libraries if you have not done so already.
74+
4. Install RepeatMasker/RepeatModeler http://www.repeatmasker.org
75+
76+
77+
78+
4b. Download Repbase RepeatMasker Libraries if you have not done so already.
4979

5080
.. code-block:: none
5181
5282
wget --user name --password pass http://www.girinst.org/server/RepBase/protected/repeatmaskerlibraries/RepBaseRepeatMaskerEdition-20170127.tar.gz
53-
tar zxvf RepBaseRepeatMaskerEdition-20170127.tar.gz -C #{HOMEBREW_PREFIX}/opt/repeatmasker/libexec
54-
55-
cd #{HOMEBREW_PREFIX}/opt/repeatmasker/libexec
56-
./configure <config.txt
83+
tar zxvf RepBaseRepeatMaskerEdition-20170127.tar.gz -C /path/to/repeatmasker/location
84+
cd /path/to/repeatmasker/location
85+
./configure
5786
5887
#Soft-link a repeatmasker utility script into the PATH:
59-
ln -s #{HOMEBREW_PREFIX}/opt/repeatmasker/util/rmOutToGFF3.pl #{HOMEBREW_PREFIX}/bin/rmOutToGFF3.pl
88+
ln -s /path/to/repeatmasker/location/repeatmasker/util/rmOutToGFF3.pl /usr/local/bin/rmOutToGFF3.pl
6089
61-
4. Setup funannotate databases:
90+
5. Install Perl modules, i.e. with cpanminus
91+
92+
.. code-block:: none
93+
94+
cpanm Getopt::Long Pod::Usage File::Basename threads threads::shared \
95+
Thread::Queue Carp Data::Dumper YAML Hash::Merge Logger::Simple Parallel::ForkManager \
96+
DBI Text::Soundex Scalar::Util::Numeric Clone JSON LWP::UserAgent DBD::mysql
97+
98+
99+
6. Setup funannotate databases:
62100

63101
.. code-block:: none
64102
65103
funannotate setup -d /path/to/DB
66104
67-
5. Export required ENV variables (your paths might differ slightly):
105+
7. Export required ENV variables (your paths might differ slightly):
68106

69107
.. code-block:: none
70108

docs/docker.rst

+1-1
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ in Docker container as the databases were too large.
3737
RepBaseRepeatMaskerEdition-20170127.tar.gz
3838
signalp-4.1c.Linux.tar.Z
3939
40-
Now you can Build the docker containiner, which will setup the remaining tools and then download and format funannotate databases.:
40+
Now you can Build the docker container, which will setup the remaining tools and then download and format funannotate databases.:
4141

4242
.. code-block:: none
4343

0 commit comments

Comments
 (0)