|  | 
| 1 |  | -# adrsm | 
| 2 |  | -Ancient DNA Read Simulator for Metagenomic  | 
|  | 1 | +<img src="./img/logo_adrsm.png" width="300"> | 
|  | 2 | + | 
|  | 3 | +# Introduction | 
|  | 4 | +ADRSM (Ancient DNA Read Simulator for Metagenomics) is a tool designed to simulate the paired-end sequencing of a metagenomic community. ADRSM allows you to control precisely the amount of DNA from each organism in the community, which can be used to benchmark different metagenomics methods. | 
|  | 5 | + | 
|  | 6 | +# Dependencies | 
|  | 7 | +- [Conda](https://conda.io/miniconda.html)   | 
|  | 8 | + | 
|  | 9 | +# Installation | 
|  | 10 | + | 
|  | 11 | +``` | 
|  | 12 | +conda install -c maxibor adrsm | 
|  | 13 | +``` | 
|  | 14 | + | 
|  | 15 | +# Usage | 
|  | 16 | + | 
|  | 17 | +``` | 
|  | 18 | +adrsm -d path/to/genome/directory configFile.txt | 
|  | 19 | +``` | 
|  | 20 | + | 
|  | 21 | +# Help | 
|  | 22 | + | 
|  | 23 | +``` | 
|  | 24 | +maxime@gph:~$ adrsm --help | 
|  | 25 | +usage: ADRSM [-h] [-d DIRECTORY] [-r READLENGTH] [-l LENSTDEV] [-fwd FWDADAPT] | 
|  | 26 | +             [-rev REVADAPT] [-e ERROR] [-o OUTPUT] [-s STATS] | 
|  | 27 | +             confFile | 
|  | 28 | +
 | 
|  | 29 | +Ancient DNA Read Simulator for Metagenomics | 
|  | 30 | +
 | 
|  | 31 | +positional arguments: | 
|  | 32 | +  confFile       path to configuration file | 
|  | 33 | +
 | 
|  | 34 | +optional arguments: | 
|  | 35 | +  -h, --help     show this help message and exit | 
|  | 36 | +  -d DIRECTORY   path to genome directory. Default = . | 
|  | 37 | +  -r READLENGTH  Average read length. Default = 76 | 
|  | 38 | +  -l LENSTDEV    Insert length standard deviation. Default = 10 | 
|  | 39 | +  -fwd FWDADAPT  Forward adaptor. Default = AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC | 
|  | 40 | +                 NNNNNNATCTCGTATGCCGTCTTCTGCTTG | 
|  | 41 | +  -rev REVADAPT  Reverse adaptor. Default = | 
|  | 42 | +                 AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT | 
|  | 43 | +  -e ERROR       Illumina sequecing error. Default = 0.01 | 
|  | 44 | +  -o OUTPUT      Output file basename. Default = ./metagenome.* | 
|  | 45 | +  -s STATS       Statistic file. Default = stats.csv | 
|  | 46 | +
 | 
|  | 47 | +``` | 
|  | 48 | + | 
|  | 49 | +## Genome directory | 
|  | 50 | +Each genome `fasta` file must be names after the name of the organism. (ex: `Bacillus_anthracis_genome.fa`) | 
|  | 51 | + | 
|  | 52 | +## Configuration file (`confFile`) | 
|  | 53 | +The configuration file is a `.csv` file describing, one line per genome, the mean insert size, and the expected genome coverage. | 
|  | 54 | +Example [short_genome_list.csv](./short_genome_list.csv): | 
|  | 55 | + | 
|  | 56 | +``` | 
|  | 57 | +genome, insert_size, coverage | 
|  | 58 | +Agrobacterium_tumefaciens_genome.fa, 47 , 0.1 | 
|  | 59 | +Bacillus_anthracis_genome.fa, 48, 0.2 | 
|  | 60 | +``` | 
0 commit comments