Does any emu2 series model support interleaved text and images generation? #93

Open

opened

on Sep 11, 2024

I only see multimodal input with text output when using Emu2, any solution to generate text + multi-images?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests