Skip to content

Latest commit

 

History

History
42 lines (27 loc) · 1.5 KB

File metadata and controls

42 lines (27 loc) · 1.5 KB

sbx_gene_family

This extension can be used to perform functional mapping, i.e. mapping metagenomic reads to proteins. The database to map against could be UniRef50, all prokaryotic proteins from KEGG, or more targeted databases, e.g. bai operon or butyrate producing genes.

Take UniRef50 database as an example. First download the uniref50.fasta into your current sunbeam_output/mapping/sbx_gene_family/databases/.

mkdir -p sunbeam_output/mapping/sbx_gene_family/database/
wget ftp://ftp.uniprot.org/pub/databases/uniprot/uniref/uniref50/uniref50.fasta.gz -P sunbeam_output/mapping/sbx_gene_family/database/

Second, update the config.yml with the proper path.

Usage

With you sunbeam conda environemnt activated,

  1. Clone into your Sunbeam extensions directory:
git clone https://github.com/sunbeam-labs/sbx_gene_clusters
  1. Add the new config options to your config file
cat sunbeam/extensions/sbx_gene_clusters/config.yml >> sunbeam_config.yml
  1. Install the requirements:
conda install --file extensions/sbx_gene_clusters/requirements.txt
  1. Run time

By default, mapping uses DIAMOND, but this extension also supports using BLAST.

sunbeam run -- --configfile sunbeam_config.yml all_gene_family