From ccb42305abf3f700190822b43849fcb808821cc8 Mon Sep 17 00:00:00 2001 From: ttang911 <8541896+ttang911@users.noreply.github.com> Date: Fri, 22 May 2026 22:45:31 +0800 Subject: [PATCH] docs: refine Evaluation Loop description and Memory & Sense heading --- README.md | 2 +- docs/technical_report.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index f26e287..24bb61e 100644 --- a/README.md +++ b/README.md @@ -93,7 +93,7 @@ On top of this runtime, Sico organizes work into **three loops** that together f - **Execution Loop**: turns an Operator goal into a traced agent run. The Cortex–Action–Memory stack executes inside an observable Sandbox and emits structured trajectories: actions, intermediate states, tool outputs, and environmental feedback. - **Evolution Loop**: converts those trajectories into reusable capability. A Reflector → Curator pipeline distills successful strategies and recurring failure patterns into a per-(project, agent) **Playbook** that is injected into the next run's workspace (training-free), while the same signals can also be fed back into base-model training (training-based). -- **Evaluation Loop** *(planned)*: attributes *why* a task failed using an **L1–L4 taxonomy** (Task Instruction Issue · DW Capability Issue · Environment Issue …), giving the Operator a targeted correction signal and routing structured failure attributions back into the Evolution Loop. +- **Evaluation Loop** *(planned)*: analyzes failed task trajectories and attributes the root cause using an L1–L4 failure taxonomy, from high-level ownership to concrete failure mode. The results help the Operator provide targeted corrections and feed failure insights back into Experience Learning and future training. ## Features diff --git a/docs/technical_report.md b/docs/technical_report.md index f48bcbd..b06e8ef 100644 --- a/docs/technical_report.md +++ b/docs/technical_report.md @@ -230,7 +230,7 @@ At the start of each chat, workspace initialization copies all relevant skills i Sandbox capabilities are exposed as HTTP APIs. For the current Android emulator sandbox, `sandbox_acquire` returns `http_api_base_url`; the agent uses the built-in `curl` tool against that URL, so the sandbox runtime only needs to ship its HTTP server, not per-endpoint agent-side wrappers. See [§4.6](#46-sandbox-observable-execution-environments) for details. -### 3.3 Memory & Sense: Experience and Context +### 3.3 Memory & Sense: Experience & Contextual Awareness A Digital Worker needs different kinds of memory at very different time scales. Rather than putting everything into a single vector store, Sico splits memory into five layers, each backed by the storage that best fits its access pattern.