Skip to content

Fine-tuned Hugging Face transformer model to predict protein binding behavior in virus-host interactions.

Notifications You must be signed in to change notification settings

olsenben/viral-transformers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Can we predict whether a mutation in the SARS-CoV-2 spike protein is likely to:

  • Increase ACE2 binding affinity?

  • Escape neutralizing antibodies?

  • Persist in the population (i.e., be evolutionarily fit)?

Data Sources

  • GISAID

  • ProteinGym

  • EMBL EBI PDBe-KB, NExtstrain, or COV-GLUE

Approach

  • Preprocess protien sequences of spike proteins into 3-mer or 6-mer tokens

  • Fine-tune ESM-1b or ProtBERT using

    • classification (beneficial vs neutral vs deleterious mutation)
    • regression (fitness scores, binding energy)
  • Optional: Include position-aware embeddings for mutations (attention to RBD)

  • Evaluate against CNN or MLP baselines

  • Visualize embeddings or attention heatmaps to identify important residues

About

Fine-tuned Hugging Face transformer model to predict protein binding behavior in virus-host interactions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published