CLIP Score is a metric used to evaluate the similarity between images and texts using the CLIP model. Adding CLIP Score as an evaluation metric will help us better measure the alignment between generated content and reference data. Please integrate CLIP Score into the evaluate.