Converts metabarcoding FAIRe-formatted sample and experiment run metadata into the required file formats for NCBI BioSample and SRA submissions.
- FAIRe formatted SampleMetadata CSV(s)
- FAIRe formatted ExperimentRunMEtadata CSV(s) that are associated with the samples.
- A Library Preparation BeBOP publicly published in a github repository.
- A filled out config.yaml file to inform the code
- It uses the list of samples in the ExperimentRunMetadata as the source of truth, so if samples exist in the SampleMetadata but not the ExperimentRunMetadata, it will drop those samples in the final NCBI filled out submission template and list those dropped samples in a folder called 'logging' saved to the directory you are saving the ncbi sample excel file to.
- The NCBI submission uses one type of library preparation, and that protocol is standardized as a BeBOP and accessible via a public github repository.
- To create the NCBI
sample_titleand SRAtitleit assumes that the FAIRegeo_loc_name, theassay_name, andsamp_categoryare completed. And for negative and positive controls; it assumes that theneg_cont_type, andpos_cont_typeare completeted. If these FAIRe fields aren't completed, the NCBI SRA and sample titles may look strange or code may fail.
- This tool is really only for metabarcoding, will need to add functionality for qPCR
- This tool has functionlity for OME's OSU runs - will take out after fixing and submitting these as this pipeline is not broadly applicable.
- Need to install Conda
- Clone repo:
git clone https://github.com/NOAA-PMEL/faire-to-ncbi-converter.git
cd faire-to-ncbi-converter
- Create Conda Environment
conda env create -f environment.yaml
conda activate ncbi-converter
-
Edit the config.yaml file
-
Run the code
python main.py