Questions about your paper

Hi
I have a few questions about your paper.
1. for example, in the case of the task of acquiring a log, you say that after getting close to the tree by directly executing the code, the log is acquired by the reinforcement learning phase. How does reinforcement learning start from there? I understand codes by Slow agent are inserted into action space. But I don't know how RL is set except for action space. Is the situation before the reinforcement learning is started inherited? How is the environment reset? In other words, I would like to know about the beginning and the end of reinforcement learning. 
2. you say that reinforcement learning is used for subactions that are determined to be learned by the slow agent, but who determines the configuration of that reinforcement learning, the Slow Agent or the Fast Agent?
3. It seems to me that there are no instructions in the slow agent prompts in Appendix to determine whether the code should be executed or learned directly. Would it be possible to publish the prompts?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about your paper #1

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Questions about your paper #1

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions