Skip to content

Conversation

@ZVAXEROWS
Copy link

Implement a Diffusers-compatible pipeline for the First-Last-Frame-to-Video (FLF2V) task. This allows users to generate videos given a first and last frame + prompt using the standard diffusers library interface and schedulers, fulfilling the Todo item: "Wan2.1 First-Last-Frame-to-Video > Diffusers integration".

This pipeline will:

Load the WanModel

Accept first_frame (image), last_frame (image), and prompt (text) as inputs.

Encode the images and text.

Run the diffusion loop using diffusers schedulers (e.g., UniPCMultistepScheduler as used in T2V example).

Decode the result into video frames.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant