-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Expand file tree
/
Copy pathallthebacteria.yaml
More file actions
48 lines (48 loc) · 2.01 KB
/
Copy pathallthebacteria.yaml
File metadata and controls
48 lines (48 loc) · 2.01 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
Name: AllTheBacteria
Description: All bacterial isolate whole-genome sequencing data from INSDC, uniformly assembled, quality-controlled, annotated, and searchable.
Documentation: https://allthebacteria.org
Contact: https://github.com/AllTheBacteria/AllTheBacteria/issues
ManagedBy: "[European Bioinformatics Institute](https://www.ebi.ac.uk/)"
UpdateFrequency: |
The current release is for all SRA bacterial isolate data up to August 2024. The
colllection will be updated occasionally, with no fixed schedule.
Tags:
- assembly
- bacteria
- bioinformatics
- fasta
- genomic
- life sciences
- microbial genomics
- short read sequencing
- whole genome sequencing
License: "[MIT License](https://opensource.org/license/mit)"
Resources:
- Description: Individual, compressed genome assemblies in .fasta format in a public S3 bucket.
ARN: arn:aws:s3:::allthebacteria-assemblies
Region: eu-west-2
Type: S3 Bucket
Explore:
- Description: Phylogenetically-compressed, batched xz archives of all genome assemblies in .fasta format in a public S3 bucket.
ARN: arn:aws:s3:::allthebacteria-phylogeneticbatches
Region: eu-west-2
Type: S3 Bucket
Explore:
- Description: Metadata for each genome assembly, including taxonomic information, in a public S3 bucket.
ARN: arn:aws:s3:::allthebacteria-metadata
Region: eu-west-2
Type: S3 Bucket
Explore:
- Description: "A [LexicMap](https://github.com/shenwei356/LexicMap) index of all genome assemblies. This can be used for efficient sequence alignment against all genomes."
ARN: arn:aws:s3:::allthebacteria-lexicmap
Region: eu-west-2
Type: S3 Bucket
Explore:
DataAtWork:
Publications:
- Title: AllTheBacteria - all bacterial genomes assembled, available and searchable
URL: https://doi.org/10.1101/2024.03.08.584059
AuthorName: Hunt M, Lima L, Anderson D, Hawkey J, Shen W, Lees J, Iqbal I
AuthorURL: https://researchportal.bath.ac.uk/en/persons/zamin-iqbal
ADXCategories:
- Healthcare & Life Sciences Data