New Generation Sequencing (NGS)

This work focuses on the study of Becnel et al. (2016) who share cancer genomic data. In our work, we are interesting into mutations that could induce pancreatic cancer. Here, we work on data from patient ID TCRBOA7. In order to reduce time computation, we only used data on chromosome 16.

Dependencies

The pipeline runs on bash. Some package are required for launching some commands such as fastqc, trimmomatic, bwa and varscan.

sudo apt install fastqc                 # For using fastqc
conda install -c bioconda trimmomatic   # For using trimmomatic
sudo apt install bwa                    # For using the bwa library
sudo apt install varscan                # For using varscan somatic

Hardware requirements

A machine with at least 8 GB of FREE RAM (to create the index and the mapping on the chromosome 16 of the reference genome).

Executing The Pipeline

The pipeline is used for detecting variants on whole exome sequencing (WES) data using paired-end files: the first two for tumor tissues and the other two for adjacent normal tissues. Here, it is used on a patient suffering from pancreatic cancer. Two files are generated at the end of the exome sequencing pipeline: a file with the single nucleotide polymorphisms (SNP) and a file with the insertions/deletions (indels). These files are available in the repository Data/Variants. The steps for generating these files are the followings.

Clone the Github repository to your machine

git clone https://github.com/Theo-Roncalli/WES-pancreatic-cancer.git
cd WES-pancreatic-cancer

Importation of reads and reference genome

bash install.sh

Creation of the variant files which contains the SNP and indels.

bash variants.sh

Cleaning Repository

For cleaning the repository (i.e. delete Data and Figures folders), please type:

bash clean.sh

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Data		Data
Figures		Figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
clean.sh		clean.sh
install.sh		install.sh
variants.sh		variants.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

New Generation Sequencing (NGS)

Dependencies

Hardware requirements

Executing The Pipeline

Cleaning Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

New Generation Sequencing (NGS)

Dependencies

Hardware requirements

Executing The Pipeline

Cleaning Repository

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages