Skip to content

Commit 8eb39f3

Browse files
authored
Update README.md
1 parent ad423aa commit 8eb39f3

File tree

1 file changed

+41
-10
lines changed

1 file changed

+41
-10
lines changed

README.md

+41-10
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,10 @@ GraphBin2 is an extension of [GraphBin](https://github.com/Vini2/GraphBin) which
1515
You will need the following python packages installed.
1616
* [Biopython](https://biopython.org/)
1717
* [python-igraph](https://igraph.org/python/)
18+
* [tqdm](https://github.com/tqdm/tqdm)
1819

1920
### Downloading GraphBin2
20-
You can download the latest release of GraphBin from [Releases](https://github.com/Vini2/GraphBin/releases) or clone the GraphBin repository to your machine.
21+
You can download the latest release of GraphBin from [Releases](https://github.com/Vini2/GraphBin/releases) or clone the GraphBin2 repository to your machine.
2122

2223
```
2324
git clone https://github.com/Vini2/GraphBin2.git
@@ -29,12 +30,31 @@ If you have downloaded a release, you will have to extract the files using the f
2930
unzip [file_name].zip
3031
```
3132

32-
Now go in to the GraphBin folder using the command
33+
Now go in to the GraphBin2 folder using the command
3334

3435
```
35-
cd GraphBin2/src/
36+
cd GraphBin2/
3637
```
3738

39+
### Setting up the environment
40+
We recommend that you use [Conda](https://docs.conda.io/en/latest/) to run GraphBin2. You can download [Anaconda](https://www.anaconda.com/distribution/) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html) which contains Conda.
41+
42+
Once you have installed Conda, make sure you are in the GraphBin2 folder. Now run the following commands to create a Conda environment and activate it to run GraphBin.
43+
44+
```
45+
conda env create -f environment.yml
46+
conda activate graphbin2
47+
```
48+
49+
Now you are ready to run GraphBin2.
50+
51+
If you want to switch back to your normal environment, run the following command.
52+
53+
```
54+
conda deactivate
55+
```
56+
57+
3858
## Preprocessing
3959

4060
Firstly, you will have to assemble your set of reads into contigs. For this purpose, you can use metaSPAdes or SGA.
@@ -129,15 +149,26 @@ For the SGA version of `graphbin2.py` takes in 4 files as inputs (required).
129149

130150
**Note:** abundance.abund is a tab separated file with contig ID and the coverage for each contig in the assembly. metaSPAdes provides the coverage of each contig in the contig identifier of the final assembly. We can directly extract these values to create the abundance.abund file. However, no such information is provided for contigs produced by SGA. Hence, reads should be mapped back to the assembled contigs in order to determine the coverage of SGA contigs.
131151

132-
**Note:** The binning output file should have comma separated values ```(contig_number, bin_number)``` for each contig. The contents of the binning output file should look similar to the example given below. Contigs are named according to their number starting from 0 and the numbering of bins starts from 1.
152+
**Note:** Make sure that the initial binning result consists of contigs belonging to only one bin. GraphBin is designed to handle initial contigs which belong to only one bin. Multiple bins for the initial contigs are not supported.
153+
154+
**Note:** The binning output file should have comma separated values ```(contig_identifier, bin_number)``` for each contig. The contents of the binning output file should look similar to the example given below. Contigs are named according to their original identifier and the numbering of bins starts from 1.
133155

134-
Example binned input
156+
Example metaSPAdes binned input
157+
```
158+
NODE_1,1
159+
NODE_2,1
160+
NODE_3,1
161+
NODE_4,2
162+
NODE_5,2
163+
...
164+
```
165+
Example SGA binned input
135166
```
136-
0,1
137-
1,2
138-
2,1
139-
3,1
140-
4,2
167+
contig-0,1
168+
contig-1,2
169+
contig-2,1
170+
contig-3,1
171+
contig-4,2
141172
...
142173
```
143174

0 commit comments

Comments
 (0)