Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
41 commits
Select commit Hold shift + click to select a range
cfc7bf6
Fixes Phase 1 of Issue #27 and Issue #103
Cateline Oct 20, 2024
9615773
Added link to MolEvolvR Case Study report. Fixes Phase 2 of Issue #27
Cateline Oct 21, 2024
9f06bb2
Delete unnecessary files
Cateline Oct 22, 2024
69916b8
Remove unnecessary CARD data files
Cateline Oct 22, 2024
0a3572e
Remove unnecessary CARD data files
Cateline Oct 22, 2024
08ed58f
Remove unnecessary CARD data files
Cateline Oct 22, 2024
a2643f1
Remove unnecessary CARD data files
Cateline Oct 22, 2024
9be2e3b
Remove unnecessary CARD data files
Cateline Oct 22, 2024
b0dbb23
Remove unnecessary CARD data files
Cateline Oct 22, 2024
8ddf883
Remove unnecessary CARD data files
Cateline Oct 22, 2024
b0c5dfa
Remove unnecessary CARD data files
Cateline Oct 22, 2024
2eb20ce
Remove unnecessary CARD data files
Cateline Oct 22, 2024
a532154
Remove unnecessary CARD data files
Cateline Oct 22, 2024
7aa8917
Remove unnecessary CARD data files
Cateline Oct 22, 2024
52ce540
Update case_studies/CARD/Bug-Drug Code.R
Cateline Oct 22, 2024
4177654
Update case_studies/CARD/Bug-Drug Code.R
Cateline Oct 22, 2024
444b520
Update Bug-Drug Code.R
Cateline Oct 24, 2024
e223f86
Add HTML report file to reports folder
Cateline Oct 24, 2024
56addcc
Delete case_studies/CARD/reports/download.htm
Cateline Oct 24, 2024
f2af6f4
Add HTML Report File
Cateline Oct 24, 2024
f590d94
Update case_studies/CARD/CARD_data/CARD-Download-README.txt
Cateline Oct 25, 2024
5d174be
Update case_studies/CARD/CARD_data/CARD-Download-README.txt
Cateline Oct 25, 2024
54e7b5b
Update case_studies/CARD/CARD_data/CARD-Download-README.txt
Cateline Oct 25, 2024
1195e1e
Update case_studies/CARD/CARD_data/CARD-Download-README.txt
Cateline Oct 25, 2024
2d80ab5
Update case_studies/CARD/CARD_data/CARD-Download-README.txt
Cateline Oct 25, 2024
b709416
Update CARD-Download-README.txt
Cateline Oct 25, 2024
eca5d37
Rename Staph_aureus_Daptomycin_sequences5.fasta to Staph_aureus_Dapto…
Cateline Oct 25, 2024
993bc09
Update Bug-Drug Code.R
Cateline Oct 27, 2024
ab67c1c
Update Bug-Drug Code.R
Cateline Oct 27, 2024
13a6e8b
Enhance logic for determining pathogen, gene, and drug fields
Cateline Oct 31, 2024
9a7688d
Enhance data mapping logic
Cateline Nov 1, 2024
14992a3
Add function to fetch and save protein FASTA sequences from Entrez
Cateline Nov 1, 2024
e105319
Update Bug-Drug Code.R
Cateline Nov 1, 2024
f6b87e7
Update case_studies/CARD/Bug-Drug Code.R
Cateline Nov 1, 2024
bbb8c91
Update case_studies/CARD/Bug-Drug Code.R
Cateline Nov 1, 2024
8e68be7
Update Bug-Drug Code.R
Cateline Nov 6, 2024
8afcba8
Update Bug-Drug Code.R
Cateline Nov 6, 2024
bcbd971
Refactor drug-pathogen filtering to support multiple drug classes and…
Cateline Nov 13, 2024
aee86b7
Data Cleanup Comparison
Cateline Nov 24, 2024
1dc5c81
Automate Case-Studies Issue #27
Cateline Nov 24, 2024
4ddc8e1
Rename Bug-Drug Code.R to bug_drug.R
jananiravi Nov 26, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
32 changes: 32 additions & 0 deletions case_studies/CARD/CARD_data/CARD-Download-README.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Cateline, thanks for adding this README. Out of curiosity, are these descriptions already paraphrased from the original source (CARD), or yet to be?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The descriptions are from the original source (CARD) and have not been paraphrased yet

Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
# CARD README

## Source:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
## Source:
## Source

This dataset was downloaded from the Comprehensive Antibiotic Resistance Database (CARD) in 2024-10 at https://card.mcmaster.ca/download/0/broadstreet-v3.3.0.tar.bz2
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This dataset was downloaded from the Comprehensive Antibiotic Resistance Database (CARD) in 2024-10 at https://card.mcmaster.ca/download/0/broadstreet-v3.3.0.tar.bz2
This dataset and associated README were downloaded from the Comprehensive Antibiotic Resistance Database (CARD) (2024-10) at https://card.mcmaster.ca/download/0/broadstreet-v3.3.0.tar.bz2.



CITATION:

Alcock et al. 2023. "CARD 2023: expanded curation, support for machine learning, and resistome
prediction at the Comprehensive Antibiotic Resistance Database" Nucleic Acids Research,
51, D690-D699. https://pubmed.ncbi.nlm.nih.gov/36263822/

## CARD SHORT NAMES

The CARD database uses standardized abbreviations, known as CARD Short Names, for AMR gene names associated with Antibiotic Resistance Ontology terms. These names are created for compatibility across data files and outputs from the Resistance Gene Identifier (RGI). Short Names for genes with 15 or fewer characters retain the original gene name, while longer names are abbreviated to uniquely represent each gene or protein. All CARD Short Names replace whitespace with underscores. For pathogen names, CARD follows the convention of capitalizing the first letter of the genus followed by the first three letters of the species in lowercase. Where applicable, CARD Short Names adopt formats such as “pathogen_gene,” “pathogen_gene_drug,” or “gene_drug.” Full lists of these abbreviations are available in the provided files:

shortname_antibiotics.tsv
shortname_pathogens.tsv"


## FASTA

The FASTA files included here contain retrieved sequences of antimicrobial resistance genes.

## Data Files Downloaded
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
## Data Files Downloaded
## Data files downloaded

aro_index.tsv
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
aro_index.tsv
`aro_index.tsv`

This file contains an index of ARO (Antibiotic Resistance Ontology) identifiers with associated GenBank accessions. Each entry includes information used to link antibiotic resistance genes to GenBank sequences.
shortname_antibiotics.tsv
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
shortname_antibiotics.tsv
`shortname_antibiotics.tsv`

Contains standardized abbreviations for antibiotics used in CARD’s short names. These abbreviations, which follow conventions from the American Society for Microbiology (ASM) and additional custom terms, provide a uniform naming system for antibiotics referenced within CARD data.

shortname_pathogens.tsv
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
shortname_pathogens.tsv
`shortname_pathogens.tsv`

Lists standardized abbreviations for pathogens used in CARD. Each abbreviation represents pathogen names in a condensed format, commonly the first letter of the genus followed by the first three letters of the species. This abbreviation system simplifies pathogen referencing in CARD outputs.
5,184 changes: 5,184 additions & 0 deletions case_studies/CARD/CARD_data/aro_categories_index.tsv

Large diffs are not rendered by default.

5,228 changes: 5,228 additions & 0 deletions case_studies/CARD/CARD_data/aro_index.tsv

Large diffs are not rendered by default.

76 changes: 76 additions & 0 deletions case_studies/CARD/CARD_data/shortname_antibiotics.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
AAC Abbreviation Molecule
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can use this for short nomenclature, e.g., spp_dru_...

AMG Aminoglycosides
AMK Amikacin
AMU Aminocoumarin
AMX Amoxicillin
ATM Aztreonam
AVI Avibactam
AZM Azithromycin
BDQ Bedaquiline
BLA Beta-lactams
CAP Capreomycin
CEF Ceftazidime
CZA Ceftazidime-Avibactam
CHL Chloramphenicol
CIP Ciprofloxacin
CLI Clindamycin
CLR Clarithromycin
CST Colistin
DAO Dapsone
DAP Daptomycin
DCS D-cycloserine
EDN Edeine
ELF Elfamycin
EMB Ethambutol
EMCM Ethambutol & Capreomycin
ENC Enacyloxin IIa
ENR Enrofloxacin
ERY Erythromycin
ETO Ethionamide
FA Fusidic acid
FLO Fluoroquinolones
FOF Fosfomycin
G418 G418
GE2A GE2270A
GEN Gentamicin
GENC Gentamicin C
HGM Hyrgomycin B
INH Isoniazid
IPM Imipenem
KAN Kanamycin
KAS Kasugamicin
KIR Kirromycin
LEV Levofloxacin
LYS Lysocin (E)
LZD Linezolid
MAC Macrolides
MULT Multiple antibiotics
MUP Mupirocin
MTZ Metronidazole
MXF Moxifloxacin
NEO Neomycin
NIT Nitrofurantoin
OFX Ofloxacin
OXZ Oxazolidinone
PAC Pactamycin
PAR Paromomycin
PAS Para-aminosalicylic acid
PCL Perchlozone
PLM Pleuromutilin
PLV Pulvomycin
PTO Prothionamide
PZA Pyrazinamide
RFB Rifabutin
RIF Rifampicin
SLF Sulfonamides
SPT Spectinomycin
STR Streptomycin
TMP Trimethoprim
TET Tetracycline
TOB Tobramycin
TRC Triclosan
TYL Tylosin
VAN Vancomycin
VIO Viomycin
ZOL Zoliflodacin
CAP capreomycin
94 changes: 94 additions & 0 deletions case_studies/CARD/CARD_data/shortname_pathogens.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
Abbreviation Pathogen
Abau Acinetobacter baumannii
Acla Alkalihalobacillus clausii
Afab Agrobacterium fabrum
Bado Bifidobacterium adolescentis
Bbac Bartonella bacilliformis
Bbif Bifidobacterium bifidum
Bbur Borreliella burgdorferi
Bdol Burkholderia dolosa
Bhyo Brachyspira hyodysenteriae
Bpse Burkholderia pseudomallei
Bpum Bacillus pumilus
Bsub Bacillus subtilis
Bsui Brucella suis
Ccol Campylobacter coli
Cacn Cutibacterium acnes
Cbut Clostridium butyricum
Cspo Clostridium sporogenes
Cdif Clostridioides difficile
Cgin Capnocytophaga gingivalis
Cjej Campylobacter jejuni
Cmen Chryseobacterium meningosepticum
Cper Clostridium perfringens
Cpsi Chlamydophila psittaci
Crei Chlamydomonas reinhardtii
Cstr Corynebacterium striatum
Ctra Chlamydia trachomatis
Eclo Enterobacter cloacae
Ecol Escherichia coli
Efac Enterococcus faecium
Efae Enterococcus faecalis
Erhu Erysipelothrix rhusiopathiae
Hhal Halobacterium halobium
Hinf Haemophilus influenzae
Hpin Haemophilus parainfluenzae
Hpyl Helicobacter pylori
Hsal Halobacterium salinarum
Kaer Klebsiella aerogenes
Kleb Klebsiella
Kpne Klebsiella pneumoniae
Lhon Laribacter hongkongensis
Lmon Listeria monocytogenes
Lreu Limosilactobacillus reuteri
Mabs Mycobacteroides abscessus
Mavi Mycobacterium avium
Mbov Mycobacterium tuberculosis variant bovis
Mcat Moraxella catarrhalis
Mche Mycobacteroides chelonae
Mfer Mycoplasmopsis fermentans
Mgal Mycoplasma gallisepticum
Mgen Mycoplasma genitalium
Mhom Mycoplasma hominis
Mint Mycobacterium intracellulare
Mkan Mycobacterium kansasii
Mlep Mycobacterium leprae
Mmor Morganella morganii
Mpne Mycoplasma pneumoniae
Msme Mycolicibacterium smegmatis
Mtub Mycobacterium tuberculosis
Ngon Neisseria gonorrhoeae
Nmen Neisseria meningitidis
Nvir Neobacillus vireti
Nfar Nocardia farcinica
Paer Pseudomonas aeruginosa
Pmir Proteus mirabilis
Pmul Pasteurella multocida
Prop Propionibacteria
Pros Planobispora rosea
Rfas Rhodococcus fascians
Rsph Rhodobacter sphaeroides
Saga Streptococcus agalactiae
Samb Streptomyces ambofaciens
Saur Staphylococcus aureus
Scin Streptomyces cinnamoneus
Scoh Staphylococcus cohnii
Sent Salmonella enterica
Sfle Shigella flexneri
Sfra Streptomyces fradiae
Sven Streptomyces venezuelae
Sint Staphylococcus intermedius
Sliv Streptomyces lividans
Smar Serratia marcescens
Smit Streptococcus mitis
Spne Streptococcus pneumoniae
Spyo Streptococcus pyogenes
Sris Streptomyces rishiriensis
Sser Salmonella serovars
Ssui Streptococcus suis
Tthe Thermus thermophilus
Uure Ureaplasma urealyticum
Vcho Vibrio cholerae
Vang Vibrio anguillarum
Vvul Vibrio vulnificus
Yent Yersinia enterocolitica
Loading