Protein Function Prediction Pipeline

This project provides a pipeline for protein function prediction using deep learning and protein language model embeddings.

Quick Start

Prepare Data:
- Use prepare_data.py to process your raw FASTA and TSV files into training-ready data.
Extract Embeddings:
- Generate embeddings for your protein sequences (see plm.py or cluster_embed/).
Train Model:
- Run the main training pipeline:
```
python train_script.py
```
- The script uses configs in configs/ and saves results in outputs/ or runs/.

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
LoRA		LoRA
Network		Network
benchmark		benchmark
cluster_embed		cluster_embed
experiments		experiments
graph		graph
structure		structure
text		text
utils		utils
.gitignore		.gitignore
README.md		README.md
metrics.py		metrics.py
myenv.yml		myenv.yml
plm.py		plm.py
prepare_data.py		prepare_data.py
train.py		train.py
train_InterLabelGO.py		train_InterLabelGO.py
train_script.py		train_script.py