-
Notifications
You must be signed in to change notification settings - Fork 19
Comparative Genomics Exercise 0: Genome assembly and annotation
Jaime Huerta-Cepas edited this page Nov 15, 2022
·
6 revisions
Login into the GDAV server
$ ssh youruser@IP
Create a directory in your home folder called compgenomics_ex1
$ mkdir compgenomics_ex0
and Enter the directory
$ cd compgenomics_ex0
All the files needed for this exercise are copied in the GDAV server at /home/compgenomics/assembly/. Make sure you can see them and take a few seconds to understand what they contain.
(already installed in the GDAV server)
- Spades
- Prokka
- bwa
Using the program spades, run a basic assembly and ORF prediction out of the two fastq files in /home/compgenomics/assembly.
$ spades.py -1 /home/compgenomics/assembly/SRR292770_1.fastq.gz -2 /home/compgenomics/assembly/SRR292770_2.fastq.gz -o my_assembly
Use prokka, get a list of predicted genes in your assembly.
conda activate roary
$ prokka my_assembly/scaffolds.fasta
bwa index scaffolds.fasta
bwa mem scaffolds.fasta /home/compgenomics/assembly/SRR292770_1.fastq.gz /home/compgenomics/assembly/SRR292770_2.fastq.gz > SRR292770.sam