This repository contains the whole-genome CRISPR target detection method as implemented in Sunagawa et al., "Mammalian Reverse Genetics without Crossing Reveals Nr3a as a Short-Sleeper Gene", Cell Reports 14(3.662-677, 2016)
-
Python 2.7
-
FASTA formatted chromosome files (obtainable via UCSC.
-
Place these files in a new directory
mm10_input/chr_sequences. -
Concatenate all sequences into a single file named
all_sequences.txt -
Build Bowtie2 indexes for these
-
-
refGene annotation data (obtainable via UCSC)
- Place this file in the
mm10_inputdirectory and name the filerefGene.txt
- Place this file in the
- Clone this repository
git clone https://github.com/bmds-lab/mm10db.git- Build the findMismatches_threads binary
cd <repo>
make pthread- Prepare gene lists. This will generate file(s) in the
mm10_inputdirectory.
python prepareGeneListsWholeGenome.py - Create list(s) of exons. Repeat for each list file generated in step 1.
python createListExons.py <listFile><listFile> is a file generated by createListExons.py. Use only the filename; strip the directory and file extension.
- Prepare the exon sequences files. Repeat for each list file generated in step 1.
python prepareExonSequences.py <listFile><listFile> is a file generated by createListExons.py. Use only the filename; strip the directory and file extension.
- Prepare list of off-target sites
python prepareListOfftargetSites.py- Run method
python target_identitification_viaC.py nb_threads_C=<int> nb_threads_Bowtie=<int> genes=<listFile>Note: we set nb_threads_C to 128 and nb_threads_Bowtie to 8
<listFile> is a file generated by createListExons.py. Use only the filename; strip the directory and file extension.
If you use this software, please cite the paper.
Sunagawa, G. A., Sumiyama, K., Ukai-Tadenuma, M., Perrin, D., Fujishima, H., Ukai, H., ... & Shimizu, Y. (2016). Mammalian reverse genetics without crossing reveals Nr3a as a short-sleeper gene. Cell reports, 14(3), 662-677.
See LICENSE