```build_labels``` includes masked image tokens?

Hi Authors,

in [these lines](https://github.com/Aleph-Alpha/magma/blob/master/magma/utils.py#L334), the function ```build_labels``` masked all the labels in positions up to the seq length of the embeddings. What differences would it make if one just use the caption?


To be more specific, now the code build a label with first part of the sequence (which has sequence length the same as the image) all set to -100, then the second part would be the actual text labels. Why would we need all the -100s? Why couldn't we just use text label ids?


Thanks a lot!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`build_labels` includes masked image tokens? #46

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

build_labels includes masked image tokens? #46

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`build_labels` includes masked image tokens? #46