Overview

This repository holds my final project code for UCSD BIOL 40236. I completed this work as part of the course and wrote all code included here.

About This Project

I built this pipeline to annotate proteins using multiple sources of biological evidence. The code builds on my experience with Linux workflows, database interaction, and Python parsing, while extending those skills through a bioinformatics application completed during the course.

Repository Structure

Scripts (sp_bioinformatics_final_project.py) Python scripts I wrote to parse inputs, query the database, and combine evidence into final annotations.

Data (hmmscan.htab, prodigal2fasta.nostars.faa, prodigal2fasta.nostars.tmhmm.short, annot_final.sql) Input files required to run my pipeline.

Output (protein_evidence.txt) Final annotation results produced by my code.

I thank Professor Orvis for his inspiration and for teaching the core material used in this project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

About This Project

Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
annot_final.sql		annot_final.sql
hmmscan.htab		hmmscan.htab
prodigal2fasta.nostars.faa		prodigal2fasta.nostars.faa
prodigal2fasta.nostars.tmhmm.short		prodigal2fasta.nostars.tmhmm.short
protein_evidence.txt		protein_evidence.txt
sp_bioinformatics_final_project.py		sp_bioinformatics_final_project.py

Folders and files

Latest commit

History

Repository files navigation

Overview

About This Project

Repository Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages