Investigating properties of natural and random 5'UTRs in silico for a high-throughput mRNA translation assay

Abstract

Protein synthesis underlies all life, and its regulation, particularly during translation, determines how efficiently proteins are produced. The 5'untranslated region (5'UTR) of mRNA is crucial to regulation, influencing translation efficiency through complex structural and sequence-based mechanisms. Despite extensive research, the multifactorial nature of 5'UTR regulation is still an unsolved problem. Recent experiments are using AI as a tool to explore these mechanisms by predicting translational effieciency from sequence data. Models were trained on randomly generated sequences to combat the sparseness of human data. Introducing this human data, however, vastly increased the performance of the model, revealing distinct structural and distributional differences . This project tries to draw closer to these differences by comparing natural and randomly generated 5'UTRs across different attributes that are inferable from the sequence alone subsequently training a convolutional neural network on sampled sequences to assess how sequence composition affects predictive performance, providing new insights into 5'UTR function and translational regulation.

Datasets

datasets 1, 3-8 can be found in new_dataset.csv datasets 9-14 are named like the thesis

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
shell scripts		shell scripts
src		src
.gitignore		.gitignore
README.md		README.md
model.ipynb		model.ipynb
plots.ipynb		plots.ipynb
positive control.ipynb		positive control.ipynb
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Investigating properties of natural and random 5'UTRs in silico for a high-throughput mRNA translation assay

Abstract

Datasets

About

Uh oh!

Releases

Packages

Languages

jechochamber/Bachelor

Folders and files

Latest commit

History

Repository files navigation

Investigating properties of natural and random 5'UTRs in silico for a high-throughput mRNA translation assay

Abstract

Datasets

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages