Skip to content

anishgude/AutomationRiskAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Human Override Gate

Problem Statement

Risk-gated automation requires routing potentially unsafe or high-risk customer requests to a human reviewer instead of fully automated handling.

Task Definition

Input: a customer message. Output: a JSON decision with override_required, reason_codes, confidence, and recommended_next_step.

Baselines

  • Rules baseline: keyword-based escalation logic.
  • Prompted baseline: base OpenAI model with a strict JSON-only prompt.

Metrics

  • Precision
  • Recall
  • False negative rate (most important)
  • Confusion matrix for override_required

How to Run

pip install -r requirements.txt
$env:OPENAI_API_KEY="YOUR_KEY"; $env:OPENAI_FT_MODEL="YOUR_FT_MODEL"; $env:OPENAI_BASE_MODEL="gpt-4o-mini"
python eval/evaluate.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages