Skip to content

[Question] Poor results when changing action prompt from "flying" to "walking" or "roaring" in demo #14

@ChambinLee

Description

@ChambinLee

Description

While testing the provided demo code, I observed that the algorithm works well for the prompt "The object is flying", producing a smooth and visually plausible animation. However, when I replace "flying" with other action prompts like "walking" or "roaring", the generated results become significantly worse—either the motion is unnatural, the object deforms incorrectly, or it barely moves at all.

Expected Behavior

The algorithm should generalize to different action prompts beyond "flying". At minimum, "walking" or "roaring" should produce recognizable motion patterns, even if not perfectly realistic.

Questions

  1. Is the current model overfitted to the "flying" action used in the demo?
  2. Would fine-tuning on other actions help, or is the model architecture fundamentally limited to certain motion types?

Any insights or suggestions would be greatly appreciated! Happy to provide more details if needed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions