Skip to content

Causal CD leads to static and blurry results #41

@WJohnnyW

Description

@WJohnnyW

Hi,

Thank you for the great work!

I am training on my own AV-joint model based on Ovi. After the S1 AR training stage, the AR generation quality is slightly worse than the base model, but still acceptable.

However, after the S2 Causal CD training stage, I found that the inference results are problematic: the generated video is almost static, and sometimes becomes very blurry. When the person is talking, there is corresponding audio, but the volume is very low.

So far, I have trained for roughly 2700 steps × 14 samples. Do you think this is simply because the training is still insufficient, or could there be something wrong with my training setup?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions