hi, I have two questions about build tokenized input to model:
-
Why using
in build conversation, why not
-
Are you using defult Qwen3VL chat template? why not creating a chat template(maybe more convenient to describe mutli-view camera info?)?
Thank you!
hi, I have two questions about build tokenized input to model:
Why using
in build conversation, why not
Are you using defult Qwen3VL chat template? why not creating a chat template(maybe more convenient to describe mutli-view camera info?)?
Thank you!