Predict TCR-epitope specificity using deep learning models and database matching to identify antigen-reactive T-cells.
pip install pandas torch scikit-learn
# ERGO-II: git clone https://github.com/IdoSpringer/ERGO-IITell your AI agent what you want to do:
- "Predict what antigens this TCR sequence recognizes"
- "Match my TCRs to known epitopes in VDJdb"
- "Cluster TCRs that likely share specificity"
"What epitopes might this CDR3 beta sequence recognize?"
"Predict binding between my TCRs and these candidate epitopes"
"Find matches for my TCRs in VDJdb"
"Identify TCRs recognizing viral epitopes"
"What fraction of my repertoire recognizes known antigens?"
"Cluster TCRs by predicted specificity"
- Parse TCR sequence data (CDR3 alpha/beta)
- Match against VDJdb or other databases
- Run ERGO-II prediction if available
- Cluster similar TCRs
- Report potential epitope specificities
- CDR3 beta - Most informative for specificity; alpha adds ~20%
- VDJdb - Curated database of known TCR-epitope pairs
- Clustering - TCRs within 1-3 edit distance often share specificity
- ERGO-II - Deep learning provides better predictions than simple matching
- Validation - Predicted specificities should be validated experimentally