Open
Description
Use WebArena benchmark.
- Setup the standalone environment of WebArena
- Configurate the urls for each website.
- Generate config file for each test example and obtain the auto-login cookies for all websites
- Write script to use WebArena's environment based on its run.py
- Save task execution results and evaluate.
- Analyze the evaluation results
Metadata
Metadata
Assignees
Labels
No labels
Activity