Skip to content

Repository for "Host origin is a determinant of parallel evolution between influenza virus gene segments" by Jones & Lakdawala. Includes source code for analysis of parallel evolution between gene segments of human and avian influenza viruses and raw data.

Notifications You must be signed in to change notification settings

Lakdawala-Lab/Host-Origin-and-Parallel-Evolution

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Host-Origin-and-Parallel-Evolution

Repository for "Host origin is a determinant of parallel evolution between influenza virus gene segments" by Jones & Lakdawala (bioRxiv 2022). Includes raw data and source code for analysis of parallel evolution between gene segments of human and avian influenza viruses.

How to use this repository:

The code provided in this repository can be used to reconstruct and analyze convergence between phylogenetic trees of gene and protein sequences. Input FASTA files from H9 viruses are provided as examples. Source code for tree reconstruction and analysis is broadly generalizable to any set of alignments and is presented separately from sequence processing and selection.

Folders

'Data'

This folder contains the raw data that were processed and analyzed in this study. This includes H9 virus FASTA files sourced from the Influenza Research Database. FASTA files are provided as raw unprocessed data in the 'Pre-processed FASTA Files' folder as well as fully aligned sequences ready for analysis in the 'Post-processing Alignments' folder. Human H3N2 virus FASTA files were sourced from an earlier study.

'Sequence Selection'

This folder contains source code for the initial processing of raw FASTA files and selection of sequences from different hosts. This code can be run directly with files provided in the Data folder under the sub-folder 'Pre-processed FASTA Files'.

'Tree Reconstruction and Analysis'

This folder contains source code for tree reconstruction and tree distance calculation. This code can be run with avian H9 virus alignments generated using FASTA files provided in the 'Data' folder and source code included in the 'Sequence Selection' folder. Alternatively, this code can be run directly with alignments provided in the 'Post-processing Alignments' folder or with alignments generated by the user.

Happy tree building! 🧬

About

Repository for "Host origin is a determinant of parallel evolution between influenza virus gene segments" by Jones & Lakdawala. Includes source code for analysis of parallel evolution between gene segments of human and avian influenza viruses and raw data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages