Preparation

The demo of fine-tuning a VLLM model with a custom dataset.

The model: Qwen2.5-VL 7B model.
The dataset: A custom dataset, which from image to generate the pddl file: https://huggingface.co/datasets/shuooru/image-hddl-dataset

Alvis usage

load needed modules: module load virtualenv/20.26.2-GCCcore-13.3.0 matplotlib/3.9.2-gfbf-2024a SciPy-bundle/2024.05-gfbf-2024a
bind the virtual environment: virtualenv --system-site-packages my_env
activate the virtual environment: source my_env/bin/activate
The working directory: cd /mimer/NOBACKUP/groups/naiss2025-22-933
Go to the qwen directory and run the training: python ft.py --config_file='cfg/qwen2_5-vl_train_0.yaml' --trainer.num_train_epochs=100

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
cfg		cfg
data_util		data_util
gemma_ft_src		gemma_ft_src
qwen_ft_src		qwen_ft_src
test		test
.gitignore		.gitignore
gemma_compare.ipynb		gemma_compare.ipynb
gemma_ft.py		gemma_ft.py
gemma_test.py		gemma_test.py
lora_model.py		lora_model.py
qwen_compare.ipynb		qwen_compare.ipynb
qwen_ft.py		qwen_ft.py
qwen_test.py		qwen_test.py
readme.md		readme.md
requirements.txt		requirements.txt
script.sh		script.sh