Skip to content

Commit 16021b4

Browse files
authored
docs(spatialtreebench): document TreeBench naming variants (#1189)
Clarify that TreeBench, SpatialTreeBench, and Spatial-TreeBench map to the existing spatialtreebench task so model-card references are traceable.
1 parent ff32722 commit 16021b4

1 file changed

Lines changed: 35 additions & 0 deletions

File tree

Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
# SpatialTreeBench (`spatialtreebench`)
2+
3+
SpatialTreeBench is a hierarchical benchmark for evaluating spatial capabilities in multimodal models, from low-level perception to higher-level simulation and agentic behavior.
4+
5+
## Name Mapping in lmms-eval
6+
7+
Model cards and reports may use different benchmark names. In lmms-eval, the following name variants all refer to this existing task:
8+
9+
- `TreeBench`
10+
- `SpatialTreeBench`
11+
- `Spatial-TreeBench`
12+
13+
Canonical lmms-eval task name: `spatialtreebench`
14+
15+
Task config: `lmms_eval/tasks/spatialtreebench/spatialtreebench.yaml`
16+
17+
Dataset configured in YAML: `LongfeiLi/SpatialTree-Bench`
18+
19+
## References
20+
21+
- Paper: [SpatialTree: How Spatial Abilities Branch Out in MLLMs](https://arxiv.org/abs/2512.20617)
22+
- Project page: [spatialtree.github.io](https://spatialtree.github.io/)
23+
- Dataset: [LongfeiLi/SpatialTree-Bench](https://huggingface.co/datasets/LongfeiLi/SpatialTree-Bench)
24+
25+
## Usage
26+
27+
```bash
28+
python -m lmms_eval \
29+
--model <model_name> \
30+
--model_args <key=value,...> \
31+
--tasks spatialtreebench \
32+
--batch_size 1
33+
```
34+
35+
Use `--tasks spatialtreebench` even when source materials mention `TreeBench` or `Spatial-TreeBench`.

0 commit comments

Comments
 (0)