Author: Siqi Zhu
This example demonstrates training a vision-language model to solve geometry problems with multi-turn tool calling.
- Complete the Installation steps
- Get your IP address:
hostname -I
bash opentinker/scripts/launch_scheduler.sh --scheduler-port <scheduler_port>python opentinker/environment/geo3k/geo3k_tool_server.py --port <env_port>python opentinker/data_preprocess/geo3k_multiturn_w_interaction.py \
--local_save_dir=data/geo3k_multiturn_w_toolpython opentinker/client/geo3k_tool_rl.py \
tokenizer_path=Qwen/Qwen2-VL-2B-Instruct \
batch_size=16 \
val_batch_size=64 \
data_path=data/geo3k_multiturn_w_tool/train.parquet \
val_data_path=data/geo3k_multiturn_w_tool/test.parquet \
num_epochs=5 \
save_freq=1000 \
test_freq=5 \
scheduler_url=http://<server_endpoint>:<scheduler_port> \
interaction.config.env_port=<env_port> \
interaction.config.env_host=<client_endpoint>See wandb run for training metrics and results.