On the Issue of Reproducing UITARS 1.5 on OSWorld

I attempted to reproduce the scores for UI-TARS-250705 (41.8%) using the script available at [run_multienv_uitars15_v1.py](https://github.com/xlang-ai/OSWorld/blob/main/run_multienv_uitars15_v1.py).
However, I encountered some bugs, for example, when constructing messages using the last 5 images during the reproduction process. After resolving these issues, I was unable to achieve the scores reported on the leaderboard (our reproduction 31.3%).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On the Issue of Reproducing UITARS 1.5 on OSWorld #225

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

On the Issue of Reproducing UITARS 1.5 on OSWorld #225

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions