Many thanks for sharing your impressive work and code. I am reading your code line by line for better understanding. One point I cannot figure out is that, why is text embedding also linearly interpolated (emb_list = auto_lerp(tlist, emb0, emb1) )? In my opinion, this kind of interpolated will change the condition (i.e. prompt), and all initial noisy input should share the same prompt condition. Please help me understand it. Many thanks again!