(FULL PIPELINE IS IN 'STAGING BRANCH', The main branch is outdated for now)

An autonomous, multi-agent security operations platform that combines deterministic cybersecurity logic with adaptive LLM reasoning to triage alerts at scale.

Overview

The Enterprise Agentic SOC is a state-of-the-art autonomous security operations platform designed to handle the overwhelming volume of security alerts in modern SOCs. By combining deterministic pattern matching (Signal Engine) with adaptive LLM reasoning (Multi-Agent System), it achieves:

Zero hallucinations on foundational security facts
Transparent decision-making with complete audit trails
Scalable 24/7 triage without human fatigue
Dual LLM Support: Local models (Ollama) or external APIs (OpenAI-compatible)

The Problem It Solves

Modern SOCs face:

Alert fatigue: 1000+ alerts per day, 95% false positives
Inconsistent triage: Human analysts vary in experience and approach
Slow response times: Manual investigation takes 15-30 minutes per alert
Lack of transparency: "Black box" AI decisions without audit trails

The Solution

A hybrid intelligence platform that:

Uses deterministic logic to extract 50+ behavioral signals (AMSI bypasses, LOTL binaries, encoded commands)
Employs multi-agent AI to reason about complex attack patterns
Enforces policy guardrails at every decision point
Maintains 100% audit compliance with complete investigation records
Delivers structured triage reports directly to Jira via email (Resend SDK)

Key Features

Hybrid Intelligence & Dual LLM Support

Signal Engine: Deterministic regex/heuristic-based detection of 50+ attack patterns
Multi-Agent System: Specialized AI agents for intake, investigation, reasoning, and decision-making
Flexible LLM Backend: Interactive selection between Local (Ollama) and External APIs
RAG-Enhanced: MITRE ATT&CK knowledge base integration via vector search

Enterprise-Grade Governance & Notifications

Policy Engine: Runtime enforcement of tool permissions, depth limits, and forbidden actions
Dual Confidence Gates: Operational (fact-based) and Analytical (reasoning quality) checks
Duplicate Detection: Tool-call hashing prevents redundant API calls
Internal Triage Notifications: High-fidelity reports sent via Resend SDK

Active Investigation Capabilities

MCP Tool Integration: SIEM queries, VirusTotal lookups, CloudTrail audits (Entra optional)
Evidence-Anchored Queries: Quoted values, narrow time windows, and laddered expansion
Query Deduping: Per-loop suppression of identical tool calls
Iterative Evidence Gathering: Up to 10 investigation loops with intelligent stopping criteria

Deterministic Risk Scoring

Risk Matrix 0–100: Weighted evidence scoring with transparent contributions
Strict thresholds: Benign 0–20, Suspicious 21–60, Malicious 61–100
Reproducible output: No speculative scoring or hidden heuristics

System Architecture

High-Level Component View

graph TB
    %% Styling
    classDef ingestion fill:#1a237e,stroke:#0d47a1,stroke-width:3px,color:#fff
    classDef normalization fill:#283593,stroke:#1976d2,stroke-width:3px,color:#fff
    classDef intel fill:#512da8,stroke:#7b1fa2,stroke-width:3px,color:#fff
    classDef governance fill:#c62828,stroke:#d32f2f,stroke-width:3px,color:#fff
    classDef agents fill:#00695c,stroke:#00897b,stroke-width:3px,color:#fff
    classDef tools fill:#e65100,stroke:#f57c00,stroke-width:3px,color:#fff
    classDef decision fill:#2e7d32,stroke:#43a047,stroke-width:3px,color:#fff
    classDef audit fill:#37474f,stroke:#546e7a,stroke-width:3px,color:#fff
    classDef feedback fill:#455a64,stroke:#607d8b,stroke-width:3px,color:#fff

    Start([Raw Security Alert<br/>Elastic/Splunk]) --> Ingest

    subgraph "Layer 1: Ingestion & Normalization"
        Ingest[Alert Ingestor<br/>intake/ingest.py]:::ingestion
        Ingest --> PreClass{Pre-Classifier<br/>intake/pre_classifier.py}:::ingestion
        PreClass -->|Empty Alert| Exit1[Skip Processing]
        PreClass -->|Valid Alert| SignalEng[Signal Engine<br/>intake/logic.py]:::normalization
        
        SignalEng --> Signals[50+ Behavioral Signals<br/>- AMSI Bypass<br/>- LOTL Binaries<br/>- Encoded Commands<br/>- External Comms]:::normalization
        Signals --> Schema[Normalized Schema<br/>schemas/alert.py<br/>NormalizedSecurityAlert]:::normalization
    end

    Schema --> Policy

    subgraph "Layer 2: Governance & Intelligence"
        Policy[Policy Engine<br/>control/policy_engine.py]:::governance
        Policy --> PolicyCheck{Policy Decision<br/>- Tool Permissions<br/>- Max Depth<br/>- Forbidden Actions}:::governance
        
        PolicyCheck -->|Denied| Exit2[Policy Violation]
        PolicyCheck -->|Approved| MITRE
        
        MITRE[MITRE RAG<br/>context/rag.py<br/>rag/vectordb.py]:::intel
        MITRE --> VectorDB[(Chroma Vector DB<br/>ATT&CK Techniques<br/>D3FEND Controls)]:::intel
        VectorDB --> TechMap[Technique Mapping<br/>context/mitre_map.py]:::intel
    end

    TechMap --> LLMChoice

    subgraph "Layer 3: Multi-Agent Orchestration"
        LLMChoice{LLM Selection<br/>Local vs External}:::agents
        LLMChoice --> IntakeAgent[Intake Agent<br/>agents/intake_agent.py<br/>Role: Gatekeeper]:::agents
        
        IntakeAgent --> IntakeDecision{Decision Logic<br/>Confidence >95%?}:::agents
        IntakeDecision -->|Benign| Close[Auto-Close<br/>No Investigation]
        IntakeDecision -->|Suspicious| InvLoop[Investigation Loop]:::agents
        
        InvLoop --> ConfGate1{Confidence Gate 1<br/>core/confidence.py<br/>Operational Check}:::governance
        ConfGate1 -->|Low Confidence| InvAgent
        ConfGate1 -->|High Confidence| ReasonAgent
        
        InvAgent[Investigation Agent<br/>agents/investigation_agent.py<br/>Role: Tier-1 Investigator]:::agents
        InvAgent --> Intent[Intent Generation<br/>LLM-Driven]:::agents
        
        Intent --> Planner[Deterministic Planner<br/>control/planner.py]:::governance
        Planner --> PlanLogic{Planning Strategy<br/>1. MITRE-Based<br/>2. Intent-Based<br/>3. Signal-Based}:::governance
        
        PlanLogic --> ToolPlan[Tool Execution Plan<br/>List of Tool Calls]:::tools
        ToolPlan --> ToolCheck{Policy Check<br/>Duplicate Detection<br/>Permission Validation}:::governance
        
        ToolCheck -->|Denied| Skip[Skip Tool]
        ToolCheck -->|Approved| Executor
        
        Executor[Tool Executor<br/>tools/executor.py]:::tools
        Executor --> MCP[MCP Tools<br/>mcp_server/tools/]:::tools
        
        MCP --> SIEM[SIEM Query<br/>siem.py]:::tools
        MCP --> VT[VirusTotal<br/>virustotal.py]:::tools
        MCP --> Cloud[CloudTrail<br/>cloudtrail.py]:::tools
        
        SIEM & VT & Cloud --> Formatter[Output Formatter<br/>mcp_server/formatter.py]:::tools
        Formatter --> Summarizer[Result Summarizer<br/>tools/summarizer.py<br/>Smart Context Pruning]:::tools
        
        Summarizer --> Evidence[Evidence Store<br/>schemas/state.py<br/>InvestigationState]:::audit
        Evidence --> LoopAudit[Loop Audit Record<br/>Intent + Tools + Results]:::audit
        
        LoopAudit --> TokenGuard{Token Guard<br/>control/token_guard.py<br/>Iteration Limit Check}:::governance
        TokenGuard -->|Max Iterations| ForceExit[Force Exit to Decision]
        TokenGuard -->|Continue| InvLoop
        
        ReasonAgent[Reasoning Agent<br/>agents/reasoning_agent.py<br/>Role: Tier-2 Analyst]:::agents
        ReasonAgent --> Analysis[Deep Analysis<br/>Connect the Dots<br/>Build Narrative]:::agents
        
        Analysis --> ConfGate2{Confidence Gate 2<br/>Analytical Check}:::governance
        ConfGate2 -->|Uncertain| InvLoop
        ConfGate2 -->|Confident| DecAgent
    end

    ForceExit --> DecAgent
    
    subgraph "Layer 4: Final Authority & Output"
        DecAgent[Decision Agent<br/>agents/decision_agent.py<br/>Role: SOC Manager]:::decision
        DecAgent --> FinalDecision{Final Classification<br/>- Benign<br/>- Suspicious<br/>- Malicious}:::decision
        
        FinalDecision --> JSONOut[Structured Output<br/>Summary + Evidence Table<br/>Score + Action]:::decision
        
        JSONOut --> EmailNotify[Email Notification<br/>Resend SDK Integration<br/>Triage Report]:::decision
        EmailNotify --> AuditExport[Audit Trail Export<br/>Complete Investigation Record]:::audit
    end
    
    AuditExport --> End([Case Resolution])

    subgraph "Feedback Loop"
        FeedbackIn[Jira Automation Webhook<br/>feedback_api/app.py]:::feedback
        FeedbackIn --> FeedbackNormalize[Normalize + Store]:::feedback
        FeedbackNormalize --> FeedbackStore[(Feedback DB)]:::feedback
        FeedbackStore --> FeedbackRAG[Feedback Retrieval<br/>context/feedback_rag.py]:::feedback
        FeedbackRAG --> IntakeAgent
    end

Workflow Pipeline

Phase 1: Ingestion & Normalization

1. Alert Ingestor (intake/ingest.py)

Fetches raw alerts from Elastic using .alerts-security.alerts-*
Maps ECS fields to NormalizedSecurityAlert

2. Pre-Classifier (intake/pre_classifier.py)

Validates alert health and rejects empty/malformed alerts

3. Signal Engine (intake/logic.py)

Deterministic logic: 50+ behavioral signals (AMSI bypass, LOTL, etc.)
Signals are injected into the LLM context as structured flags

Phase 2: Governance & Intelligence

4. Policy Engine (control/policy_engine.py)

Enforces tool permissions, depth limits, forbidden actions

5. MITRE RAG (context/rag.py, rag/vectordb.py)

Semantic search over ATT&CK via ChromaDB

Phase 3: Multi-Agent Orchestration

6. LLM Support Selection

Interactive selection between Local (Ollama) or External (OpenAI-compatible)

7. Intake Agent (agents/intake_agent.py)

Gatekeeper: filters obvious false positives with >95% confidence threshold

8. Investigation Agent (agents/investigation_agent.py)

Tier-1 investigator: generates technical tool intents

9. Deterministic Planner (control/planner.py)

Converts intents into evidence-anchored tool plans with quoted values
Avoids duplicate SIEM queries per loop
Uses narrow time windows (default +/- 3 minutes)
Prefers event.code over free-text message filters

10. Reasoning Agent (agents/reasoning_agent.py)

Tier-2 analyst: synthesizes evidence into narratives

Phase 4: Final Authority & Output

11. Decision Agent (agents/decision_agent.py)

Produces summary + evidence table + risk score + classification
Outputs 4-6 sentence SOC analyst close note with timestamps and IOCs

12. Email Notification (utils/email_notifier.py)

Resend SDK integration for Jira/SOC routing

13. Audit Trail Export

Writes audit_trail_[alert_id].json for full transparency

Module Documentation

Layer 1: Ingestion & Normalization

Module	File	Purpose
Alert Ingestor	`intake/ingest.py`	Fetches raw alerts from Elastic, maps ECS fields
Signal Engine	`intake/logic.py`	Deterministic logic: 50+ behavioral signals

Layer 2: Governance & Intelligence

Module	File	Purpose
Policy Engine	`control/policy_engine.py`	Enforces tool permissions and depth limits
MITRE RAG	`context/rag.py`	Vector search over ATT&CK techniques

Layer 3: Multi-Agent Orchestration

Module	File	Purpose
LLM Client	`llm/client.py`	Unified client supporting Local and External
Intake Agent	`agents/intake_agent.py`	High-confidence gatekeeper
Investigation Agent	`agents/investigation_agent.py`	Hypothesis-driven evidence gathering
Reasoning Agent	`agents/reasoning_agent.py`	Tier-2 analyst narratives
Decision Agent	`agents/decision_agent.py`	Final authority with structured output

Layer 4: Notifications & Reporting

Module	File	Purpose
Email Notifier	`utils/email_notifier.py`	Resend SDK integration
Result Summarizer	`tools/summarizer.py`	Context pruning and evidence shaping

Installation

Prerequisites

Python 3.10+
Elasticsearch (accessible via API)
Ollama (for local models)
Resend API Key (for notifications)

Setup

Clone the repo
pip install -r requirements.txt
Install Ollama and pull your preferred model (e.g., ollama pull llama3.1:8b)
Initialize the vector DB: python rag/ingestion.py

Configuration

Create a .env file from .env.example:

# Elastic SIEM
ELASTIC_BASE_URL=https://your-elastic-url:9200
ELASTIC_API_KEY=your-key

# LLM Configuration
LLM_MODEL="llama3.1:8b" # Local
EXTERNAL_LLM_API_KEY="sk-..."
EXTERNAL_LLM_URL="https://api.openai.com/v1/chat/completions"
EXTERNAL_LLM_MODEL="gpt-5.2"

# VirusTotal Configuration
VT_API_KEY="your-vt-api-key"

# Notifications (Resend API)
RE_SEND_KEY="re_..."
NOTIFY_EMAIL="jira@yourdomain.atlassian.net"
FROM_EMAIL="soc-ai@yourdomain.com"

# Feedback Loop
FEEDBACK_API_KEY="your-key"
FEEDBACK_DB_PATH=feedback_api/feedback.db

Output Format

Decision output is deterministic and structured:

Summary paragraph (4–6 sentences, SOC L2/3 tone, timestamps + IOCs)
Evidence table (category, evidence, weight, confidence, contribution)
Final score (0–100)
Final classification (Benign/Suspicious/Malicious)
Recommended action (Close/Escalate/Contain)

Usage

Run the orchestrator:

python main.py

The system fetches alerts from Elastic.
An interactive prompt asks you to select Local or External LLM.
The multi-agent pipeline executes the investigation.
Final triage reports are logged locally and emailed to your configured recipient.

Project Structure

soc-ai-TriageAgent/
├── agents/             # Multi-agent roles (Intake, Investigation, Reasoning, Decision)
├── control/            # Governance layer (Policy, Planner, Token Guard)
├── intake/             # Ingestion & Signal Engine
├── utils/              # Resend Email Notifier, Pipeline Logging
├── llm/                # Unified Client (Local/External)
├── rag/                # Vector DB and MITRE data ingestion
├── schemas/            # Pydantic state and alert models
├── tools/              # MCP tool orchestration and summarization
├── feedback_api/       # Feedback webhook + storage
└── main.py             # Entry point

Feedback Loop

FastAPI listener that receives Jira webhooks, normalizes payloads, and stores feedback for retrieval in new investigations.

POST https://api.gabessoc.com/webhook/jira
Header: X-API-Key: <FEEDBACK_API_KEY>

Run locally:

python -m uvicorn feedback_api.app:app --host 0.0.0.0 --port 8001

Detailed Documentation

Deep technical details live in the repo wiki:

Scoring engine: docs/wiki/SCORING_ENGINE.md
Signals engine: docs/wiki/SIGNALS_ENGINE.md
MCP + SIEM guardrails: docs/wiki/MCP_AND_SIEM.md
Full pipeline breakdown: docs/wiki/PIPELINE.md
Loop scratch sheet: docs/wiki/PIPELINE_LOOP.md

Built with love for the SOC community

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Rag-Data		Rag-Data
agents		agents
chroma_db		chroma_db
context		context
control		control
core		core
docs/wiki		docs/wiki
elastic		elastic
feedback_api		feedback_api
intake		intake
llm		llm
mcp_server		mcp_server
rag		rag
schemas		schemas
tools		tools
utils		utils
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

(FULL PIPELINE IS IN 'STAGING BRANCH', The main branch is outdated for now)

Table of Contents

Overview

The Problem It Solves

The Solution

Key Features

Hybrid Intelligence & Dual LLM Support

Enterprise-Grade Governance & Notifications

Active Investigation Capabilities

Deterministic Risk Scoring

System Architecture

High-Level Component View

Workflow Pipeline

Phase 1: Ingestion & Normalization

Phase 2: Governance & Intelligence

Phase 3: Multi-Agent Orchestration

Phase 4: Final Authority & Output

Module Documentation

Layer 1: Ingestion & Normalization

Layer 2: Governance & Intelligence

Layer 3: Multi-Agent Orchestration

Layer 4: Notifications & Reporting

Installation

Prerequisites

Setup

Configuration

Output Format

Usage

Project Structure

Feedback Loop

Detailed Documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages