[Feature] Behavioral trust scoring for multi-agent tool calls via MCP Observatory #7714
Replies: 2 comments
-
|
This is very close to a failure mode I’m testing around runtime trust gates. Before an agent follows an external task or calls an external tool, I’m trying to see whether it can verify the basics:
I’m collecting a few quick external runs here: https://the-agents-of-nations.vercel.app/llms.txt No full review needed. A failed run is useful if it shows where the trust gate should have caught the problem. |
Beta Was this translation helpful? Give feedback.
-
|
Exactly the problem we solved. AgentGate's behavioral dimension (20% of total trust) tracks request velocity, access pattern anomalies, and cross-session patterns across 24h. The key: score the sequence, not just the individual request. An agent reading 10 files in 5 minutes has a different behavioral score than the same 10 files read over 2 hours. https://www.tryagentgate.com/ |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Problem
In multi-agent systems, agents delegate to other agents and call external tools (MCP servers, APIs). AutoGen currently treats all registered tools as equally trusted at runtime. There's no mechanism to check whether a tool server is behaving normally right now — a server that passed all checks yesterday could be timing out, returning anomalous data, or exhibiting behavioral drift today.
This matters more in AutoGen than single-agent frameworks because agent-to-agent delegation amplifies tool failures. When AgentA hands off to AgentB which calls ToolC, a behavioral anomaly at ToolC cascades through the chain before anyone detects it.
Proposal
Integrate optional runtime behavioral trust scoring using the Dominion Observatory — a behavioral trust registry tracking 14,800+ MCP servers with anonymized telemetry (latency, success rate, anomaly detection).
The integration would:
What this looks like in AutoGen
Why this matters for AutoGen specifically
Existing infrastructure
dominion-observatory.sgdata.workers.dev, tracking 14,800+ servers, 87,000+ interactionspip install dominion-observatory—check_trust(server_url)returns trust score + anomaly flagsObservatoryTrustCallbackHandler— same before/after pattern@dominion/trust-provideron npm withbeforeSettlehook for x402 protocolIntegration points
Cleanest integration options:
TrustGateMiddlewareas anautogen_extextensionHappy to contribute a PR. The Python SDK and hook pattern are built — it's a matter of wiring into AutoGen's tool execution lifecycle.
References
Beta Was this translation helpful? Give feedback.
All reactions