Skip to content

Commit 9e8de8a

Browse files
authored
Update README.md
1 parent b0e8526 commit 9e8de8a

File tree

1 file changed

+16
-1
lines changed

1 file changed

+16
-1
lines changed

README.md

Lines changed: 16 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -110,7 +110,22 @@ During this process, a few settings were adjusted:
110110
These updates generally lead to improved success rates across most tasks.
111111
As a result, your observed performance (e.g., **100% on “push button”**) may exceed the numbers reported in the paper.
112112

113-
### Updated Results (Open-Source Version)
113+
## Experiments Results
114+
115+
### Evluation over 60 RLBench tasks
116+
Why we use 60 tasks for the main evaluation?
117+
Although the 18 RLBench tasks have been widely adopted as a benchmark since their introduction in Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation, they are primarily used to evaluate 3D-based hierarchical policies that depend heavily on high-precision 3D inputs and motion planners. Many of these tasks are extremely challenging for RGB-only visuomotor policies, often leading to uniformly low success rates and therefore limited discriminative power.`
118+
119+
<img width="1105" height="473" alt="coa_performance" src="https://github.com/user-attachments/assets/b4408c9d-311b-4c42-9cdb-74decfdb91ef" />
120+
121+
122+
### Evluation over 18 RLBench tasks
123+
124+
To enable convenient comparison with 3D-based hierarchical methods—such as RVT-2, we also report results on the RLBench-18 benchmark. Plase check appendix for more details.
125+
126+
<img width="706" height="431" alt="coa_rlbench18" src="https://github.com/user-attachments/assets/3b698819-fd0a-4e6e-979e-f64ec108df52" />
127+
128+
## Updated Results (Open-Source Version)
114129

115130
For reference, below are the task-level success rates of the open-source implementation compared with those reported in the paper.
116131
The open-source version generally achieves higher performance due to the modified training configuration.

0 commit comments

Comments
 (0)