multi-bleu.perl

Hi,

Is this the same script with the Moses's multi-bleu.perl? I've seen that there are some modifications to the original version. I've been investigating that why my baseline model's (Google NIC with VGG-E) BLEU-2-3-4 performance is really low but what I've found is we are not using the same evaluation scripts. I know that this task is different than machine translation task, though. So, my questions are,

- What's the intention behind the BLEU evaluation script modification?
- Is all captioning people evaluate their models with this approach?

Thanks in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

multi-bleu.perl #49

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

multi-bleu.perl #49

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions