This is the code for the ICLR 2025 paper:
Multimodal Quantitative Language for Generative Recommendation
pytorch==2.1.0
transformers <= 4.45.0
- We found that different versions of transformers significantly impact convergence speed and performance under default parameters.
For newer versions, parameter adjustments are required.
We tested different transformers versions with accelerate==0.28.0:
v4.43.0, 4.44.0 are incompatible with accelerate==0.28.0 - version change required.
cd data_process
- Download images
- Process data to ensure each item corresponds to one image and one text description
- Generate text embeddings
- Generate image embeddings
Preprocessed data, pretrained checkpoints, and training logs:
Google Drive Folder
cd index
bash script/run.sh # Run training
bash script/gen_code_dis.sh # Generate code
bash script/pretrain.sh
bash finetune.sh
- Adjust file paths according to your local directory structure
PRs and issues are welcome!
N/A