RNU4-2_Saturation_Genome_Editing (SGE)

This is a repository containing the source code for analysing the RNU4-2 SGE data from De Jonghe et al. medrxiv (2025).

Primary analysis

The primary analysis is performed on de-multiplexed fastq files (forward and reverse read files) and provides variant read count files for both replicates (RNU42brL41.txt and RNU42brL42.txt). The pipeline runs similarily to Buckley et al. Nature Genetics (2024). It comprises a main script RNU4-2_pipeline.sh which accesses a set of scripts to perform variant counting. It is recommended to generate a custom python environment with the following packages installed:

Python/2.7.18-GCCcore-9.3.0

GCC/5.4.0-2.26

EMBOSS

The scripts and data files can be found in the primary_analysis directory, the sequencing files are available for download at the ENA (project code).

Secondary analysis

The secondary analysis directory takes variant read count files and performs filtering, normalisation and plotting. The main analysis is performed in a jupyter notebook (RNU4_2_SGE.ipynb). Some plots were generated in R and can be found in the RNU4_2_SGE.R script. All files to run the analysis can be found in the secondary_analysis directory.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
primary_analysis		primary_analysis
secondary_analysis		secondary_analysis
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNU4-2_Saturation_Genome_Editing (SGE)

Primary analysis

Secondary analysis

About

Releases 2

Packages

Languages

FrancisCrickInstitute/RNU4-2_Saturation_Genome_Editing

Folders and files

Latest commit

History

Repository files navigation

RNU4-2_Saturation_Genome_Editing (SGE)

Primary analysis

Secondary analysis

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages