[agentic_model_enabler]: Add agent (openvinotoolkit#3679)

as-suvorov · Copilot · web-flow · commit b67567404948 · 2026-04-14T18:19:44.000Z
## Description  ## Checklist: - [x] This PR follows [GenAI Contributing guidelines](https://github.com/openvinotoolkit/openvino.genai?tab=contributing-ov-file#contributing).  - [NA] Tests have been updated or added to cover the new code.  - [x] This PR fully addresses the ticket.  - [NA] I have made corresponding changes to the documentation.  --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>
diff --git a/.github/agents/model-enabler.agent.md b/.github/agents/model-enabler.agent.md
@@ -0,0 +1,60 @@
+---
+description: "Enable a new model for OpenVINO GenAI. Use when: validating a new HuggingFace model, checking model support, running model-checker, updating supported-models docs, model enablement workflow, enabling a model with optimum-intel."
+tools: [read, edit, search, execute, todo]
+argument-hint: "<model_id> <task>  e.g. google/gemma-3-4b-it image-text-to-text"
+---
+
+You are the OpenVINO GenAI Architect. Your job is to fully enable a new HuggingFace model by validating it works with OpenVINO GenAI and updating the site documentation to reflect its support.
+
+## Skills
+
+| Skill         | Path                                    |
+| ------------- | --------------------------------------- |
+| model-checker | `.github/skills/model-checker/SKILL.md` |
+| update-docs   | `.github/skills/update-docs/SKILL.md`   |
+
+## Inputs
+
+Expect the user to provide:
+
+- **model_id**: HuggingFace model identifier (e.g. `google/gemma-3-4b-it`)
+- **task**: optimum export task (e.g. `image-text-to-text`, `text-generation-with-past`)
+
+If either is missing, ask for them before proceeding.
+
+## Workflow
+
+### Step 1: Model Validation
+
+Read and follow the **model-checker** skill.
+
+Read **model-checker** step results. Depending on the results:
+
+- If all steps passed, proceed to Step 3.
+- If optimum-intel export failed or wwb results for optimum-intel below threshold, proceed to Step 4. Provide a summary of the failure and relevant log paths for context.
+- If GenAI inference test failed or wwb results for GenAI below threshold, proceed to Step 2.
+
+### Step 2: Model Enablement
+
+Proceed with model enablement.
+
+### Step 3: Documentation Update
+
+Read and follow the **update-docs** skill.
+
+### Step 4: Final Report
+
+Report a structured summary:
+
+- **Model**: `<model_id>` (`<task>`)
+- **Validation**: PASSED / FAILED
+- **Performance** (if passed):
+  - 1st token latency, 2nd token latency, throughput
+  - Optimum similarity / GenAI similarity
+- **Model Enablement Status**:
+  - **Enabled/Not Enabled** if passed all model-checker steps
+  - **Details**: Provide a summary of changes. Highlight design and architectural decisions made during enablement.
+- **Docs Update Status**:
+  - **Updated/Not Updated** if updated the supported models docs
+  - **Details**: summary of doc changes.
+- **Details**: Additional details required for context, next steps, or follow-ups.
diff --git a/.github/skills/model-checker/SKILL.md b/.github/skills/model-checker/SKILL.md
@@ -1,10 +1,10 @@
 ---
-name: genai-model-checker
+name: model-checker
 description: "Validate a newly supported optimum-intel model with OpenVINO GenAI. Use when: checking new model support, verifying model export to OpenVINO IR, running GenAI inference test with llm_bench, benchmarking model accuracy with who-what-benchmark."
 argument-hint: "model_id and task (e.g. tencent/HY-MT1.5-1.8B text-generation-with-past)"
 ---
 
-# GenAI Model Checker
+# Model Checker
 
 Validates that a HuggingFace model exported via optimum-intel works correctly with OpenVINO GenAI pipelines and passes accuracy benchmarks.
 
@@ -31,13 +31,15 @@ The user must provide:
 
 ## Prerequisites
 
-Activate the Python virtual environment before running any commands.
+Ensure the Python virtual environment is activated before running any commands.
 
 1. **Locate the virtual environment** — check for common directories at the repository root: `.venv/`, `venv/`, `env/`. Use `list_dir` to find it. If none is found, ask the user for its location.
-2. **Activate** based on the current platform:
+2. **Check if already activated**: if `which python` or `where python` points inside the virtual environment, it's already activated. If not, proceed to activate it.
+3. **Activate** based on the current platform:
    - **Linux/macOS**: `source <venv_path>/bin/activate`
    - **Windows (cmd)**: `<venv_path>\Scripts\activate.bat`
    - **Windows (PowerShell)**: `<venv_path>\Scripts\Activate.ps1`
+4. The background terminal doesn't inherit the venv activation. Run it with the venv activated in the same command.
 
 ## Procedure
 
@@ -46,13 +48,13 @@ Activate the Python virtual environment before running any commands.
 Run the checker script from the repository root:
 
 ```
-python3 .github/skills/genai-model-checker/scripts/check_model.py \
+python3 .github/skills/model-checker/scripts/check_model.py \
     --model-id <model_id> \
     --task <export_task> \
-    --work-dir /tmp/genai-model-check
+    --work-dir .model_enabler/model_checker
 ```
 
-Run `python3 .github/skills/genai-model-checker/scripts/check_model.py --help` for the full argument reference including defaults.
+Run `python3 .github/skills/model-checker/scripts/check_model.py --help` for the full argument reference including defaults. The `--work-dir` is where all intermediate files, logs, and outputs will be stored. Do not pipe with any additional logging or redirection — the script handles its own logging.
 
 #### Skip flags (for re-runs after a fix)
 
@@ -81,9 +83,19 @@ The script logs progress for each step and exits with code 0 (pass) or non-zero
 
 **Log files:** each tool writes its own dedicated log; paths are printed during execution. When a step fails, read the corresponding log for the full traceback and context before drawing any conclusions.
 
+**work-dir:** work-dir is in current workspace, prefer to use tool calls to access logs and outputs instead of custom bash commands.
+
 ### Step 3: Report Results
 
-Report results to the user. If all steps pass, indicate success and provide performance metrics and accuracy data. If a step fails, analyze tool logs and provide a summary with the failed tool log path.
+Results format:
+
+- **Model**: `<model_id>` (`<task>`)
+- **Validation**: PASSED / FAILED
+- **Performance** (if passed):
+  - 1st token latency, 2nd token latency, throughput
+  - Optimum similarity / GenAI similarity (if applicable)
+- **Logs**: paths to export log, llm_bench log, WWB logs
+- **Failed step analysis** (if failed): summary of the failure and relevant log path for details
 
 ### Security
 
diff --git a/.github/skills/model-checker/scripts/check_model.py b/.github/skills/model-checker/scripts/check_model.py
diff --git a/.gitignore b/.gitignore
@@ -44,3 +44,6 @@ __pycache__
 
 # CodeQL artifacts
 _codeql_detected_source_root
+
+# agentic model enabler
+.model_enabler