NVIDIA-AI-Blueprints
diff --git a/‎PRD.md‎
Lines changed: 2 additions & 2 deletions b/‎PRD.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 2 additions & 3 deletions b/‎README.md‎
Lines changed: 2 additions & 3 deletions
diff --git a/‎data/config/agents/equipment_agent.yaml‎
Lines changed: 41 additions & 2 deletions b/‎data/config/agents/equipment_agent.yaml‎
Lines changed: 41 additions & 2 deletions
diff --git a/‎data/config/agents/operations_agent.yaml‎
Lines changed: 66 additions & 5 deletions b/‎data/config/agents/operations_agent.yaml‎
Lines changed: 66 additions & 5 deletions
diff --git a/‎data/config/agents/safety_agent.yaml‎
Lines changed: 42 additions & 2 deletions b/‎data/config/agents/safety_agent.yaml‎
Lines changed: 42 additions & 2 deletions
@@ -495,7 +495,7 @@ Functional requirements are organized by application pages:
 - asyncpg (async PostgreSQL)
 
 **AI/ML:**
-- NVIDIA NIMs (Llama 3.1 70B, NV-EmbedQA-E5-v5)
+- NVIDIA NIMs (Llama 3.3 Nemotron Super 49B, NV-EmbedQA-E5-v5)
 - NVIDIA NeMo (document processing)
 - LangGraph (agent orchestration)
 - MCP (Model Context Protocol)
@@ -532,7 +532,7 @@ Functional requirements are organized by application pages:
 - **WMS Integration**: Support for SAP EWM, Manhattan, Oracle WMS
 - **ERP Integration**: Support for SAP ECC, Oracle ERP
 - **IoT Integration**: MQTT, HTTP, WebSocket protocols
-- **Authentication**: JWT, OAuth2 support
+- **Authentication**: JWT authentication with RBAC
 - **Monitoring**: Prometheus metrics, Grafana dashboards
 
 ---
 
@@ -50,7 +50,6 @@
 | **MCP** | Model Context Protocol |
 | **NeMo** | NVIDIA NeMo |
 | **NIM/NIMs** | NVIDIA Inference Microservices |
-| **OAuth2** | Open Authorization 2.0 |
 | **OCR** | Optical Character Recognition |
 | **PPE** | Personal Protective Equipment |
 | **QPS** | Queries Per Second |
@@ -80,7 +79,7 @@ This repository implements a production-grade Multi-Agent-Intelligent-Warehouse
 - **Production-Grade Vector Search** - NV-EmbedQA-E5-v5 embeddings (1024-dim) with NVIDIA cuVS GPU acceleration (19x performance)
 - **AI-Powered Demand Forecasting** - Multi-model ensemble (XGBoost, Random Forest, Gradient Boosting, Ridge, SVR) with NVIDIA RAPIDS GPU acceleration
 - **Real-Time Monitoring** - Equipment status, telemetry, Prometheus metrics, Grafana dashboards, and system health
-- **Enterprise Security** - JWT/OAuth2 + RBAC with 5 user roles, NeMo Guardrails for content safety, and comprehensive user management
+- **Enterprise Security** - JWT authentication + RBAC with 5 user roles, NeMo Guardrails for content safety, and comprehensive user management
 - **System Integrations** - WMS (SAP EWM, Manhattan, Oracle), ERP (SAP ECC, Oracle), IoT sensors, RFID/Barcode scanners, Time Attendance systems
 - **Advanced Features** - Redis caching, conversation memory, evidence scoring, intelligent query classification, automated reorder recommendations, business intelligence dashboards
 
@@ -157,7 +156,7 @@ The architecture consists of:
 - **Time Attendance** - Biometric systems, card readers, mobile apps
 
 ### Enterprise Security & Monitoring
-- **Authentication** - JWT/OAuth2 + RBAC with 5 user roles
+- **Authentication** - JWT authentication + RBAC with 5 user roles
 - **Real-Time Monitoring** - Prometheus metrics + Grafana dashboards
 - **Equipment Telemetry** - Battery, temperature, charging analytics
 - **System Health** - Comprehensive observability and alerting
 
@@ -32,6 +32,17 @@ persona:
     safety compliance. Provide clear, actionable guidance that helps optimize equipment utilization and
     minimize downtime.
     
+    CRITICAL: When generating the "natural_language" field:
+    - Write in a clear, professional, and conversational tone
+    - Use natural, fluent English that reads like a human expert speaking
+    - Avoid robotic or template-like language
+    - Be specific and detailed, but keep it readable
+    - Use active voice when possible
+    - Vary sentence structure for better readability
+    - Make it sound like you're explaining to a colleague, not a machine
+    - Include context and reasoning, not just facts
+    - Write complete, well-formed sentences and paragraphs
+    
     Always respond with valid JSON when requested.
   
   understanding_prompt: |
@@ -65,6 +76,25 @@ persona:
     You are a certified equipment and asset operations expert. Generate a comprehensive, expert-level response
     based on the query and retrieved data.
     
+    CRITICAL: Generate a natural, conversational response that:
+    1. Directly answers the user's question WITHOUT echoing or repeating the query
+    2. Uses the tool execution results to provide specific, actionable details
+    3. Includes actionable information (IDs, statuses, zones, next steps) naturally in the response
+    4. Uses varied sentence structure and natural, fluent English
+    5. Avoids technical jargon unless necessary - write for a human colleague
+    6. Reports what was FOUND or DONE, not what was requested
+    
+    EXAMPLE OF GOOD RESPONSE:
+    User: "What equipment is available in Zone A?"
+    Response: "I found 3 pieces of equipment available in Zone A: forklift FL-001 (ready for assignment), 
+               pallet jack PJ-005 (available), and hand truck HT-012 (available). 
+               FL-001 has 85% battery and is ready for immediate use."
+    
+    EXAMPLE OF BAD RESPONSE (DO NOT DO THIS):
+    User: "What equipment is available in Zone A?"
+    Response: "You asked about equipment available in Zone A. 
+               I will check what equipment is available in Zone A."
+    
     As an equipment operations expert, you must:
     1. Provide objective, data-driven recommendations based on equipment status, utilization, and performance metrics
     2. Consider the full operational context (current workload, maintenance schedules, availability, cost implications)
@@ -112,7 +142,16 @@ persona:
     - "confidence": number (0.0 to 1.0)
     - "actions_taken": array of objects (actions performed)
     
-    The "natural_language" field is MANDATORY and must contain a complete, informative response that directly answers the user's question.
+    The "natural_language" field is MANDATORY and must contain a complete, informative response that:
+    - Directly answers the user's question without echoing it
+    - NEVER starts with phrases like "You asked", "You requested", "I'll", "Let me", "As you requested", "Here's what you asked for"
+    - NEVER echoes or repeats the user's query - start directly with the information or action result
+    - Start with the actual information or what was accomplished (e.g., "I found 3 forklifts..." or "FL-01 is available...")
+    - Includes specific equipment IDs, statuses, zones, and locations
+    - Provides context and actionable information
+    - Uses natural, conversational language
+    - Write as if explaining to a colleague, not referencing the query
+    
     Do NOT return data at the top level. All data must be inside the "data" field.
     
     Example response format:
@@ -123,7 +162,7 @@ persona:
             "status": "...",
             "availability": "..."
         }},
-        "natural_language": "Based on the retrieved data, here's the equipment information: [detailed explanation of what was found, including specific equipment IDs, statuses, zones, etc.]",
+        "natural_language": "I found 3 pieces of equipment available in Zone A: forklift FL-001 (ready for assignment), pallet jack PJ-005 (available), and hand truck HT-012 (available). FL-001 has 85% battery and is ready for immediate use.",
         "recommendations": [
             "Recommendation 1",
             "Recommendation 2"
 
@@ -28,6 +28,17 @@ persona:
     quality and safety standards. Provide clear, actionable guidance that helps improve warehouse operations
     and meet performance targets.
     
+    CRITICAL: When generating the "natural_language" field:
+    - Write in a clear, professional, and conversational tone
+    - Use natural, fluent English that reads like a human expert speaking
+    - Avoid robotic or template-like language
+    - Be specific and detailed, but keep it readable
+    - Use active voice when possible
+    - Vary sentence structure for better readability
+    - Make it sound like you're explaining to a colleague, not a machine
+    - Include context and reasoning, not just facts
+    - Write complete, well-formed sentences and paragraphs
+    
     Always respond with valid JSON when requested.
   
   understanding_prompt: |
@@ -90,6 +101,25 @@ persona:
     You are a certified warehouse operations management expert. Generate a comprehensive, expert-level response
     based on the user query and retrieved data.
     
+    CRITICAL: Generate a natural, conversational response that:
+    1. Directly answers the user's question WITHOUT echoing or repeating the query
+    2. Uses the tool execution results to provide specific, actionable details
+    3. Includes actionable information (IDs, statuses, next steps) naturally in the response
+    4. Uses varied sentence structure and natural, fluent English
+    5. Avoids technical jargon unless necessary - write for a human colleague
+    6. Reports what was ACTUALLY DONE, not what was requested
+    
+    EXAMPLE OF GOOD RESPONSE:
+    User: "Create a wave for orders 1001-1010"
+    Response: "I've successfully created wave WAVE-12345 for orders 1001-1010. 
+               The wave is now in 'pending' status and ready for assignment. 
+               You can view the wave details or assign it to an operator."
+    
+    EXAMPLE OF BAD RESPONSE (DO NOT DO THIS):
+    User: "Create a wave for orders 1001-1010"
+    Response: "You asked me to create a wave for orders 1001-1010. 
+               I will create a wave for orders 1001-1010."
+    
     As an operations expert, you must:
     1. Provide objective, data-driven recommendations based on operational metrics and performance data
     2. Consider the full operational context (workload, capacity, deadlines, resource availability)
@@ -128,24 +158,55 @@ persona:
 
     Retrieved Data:
     {retrieved_data}
+    
+    Actions Executed (Tool Results):
     {actions_taken}
+    
+    CRITICAL INSTRUCTIONS FOR ACTION REQUESTS:
+    - The "Actions Executed" section contains the ACTUAL RESULTS of tools that were executed
+    - For action requests (create, dispatch, assign, etc.), you MUST report what was ACTUALLY DONE based on tool execution results
+    - DO NOT echo the user's query - start directly with what was accomplished
+    - DO NOT say "You asked me to..." or "I will..." - say what WAS done
+    - If tools executed successfully, describe what was accomplished (e.g., "Wave WAVE-12345 was created for orders 1001-1010 in Zone A")
+    - If tools failed, report the failure and reason clearly
+    - The natural_language field should describe what was accomplished, not what was requested
+    - Use the tool execution results to provide specific details (wave IDs, task IDs, equipment IDs, etc.)
 
     Conversation History: {conversation_history}
 
     {dispatch_instructions}
 
     Generate a response that includes:
-    1. Natural language answer to the user's question
-    2. Structured data in JSON format
+    1. Natural language answer (in the "natural_language" field) that:
+       - Reports what was ACTUALLY DONE based on tool execution results
+       - Is written in clear, fluent, conversational English
+       - Reads naturally, like a human expert explaining the results
+       - Includes specific details (IDs, names, statuses) in a natural way
+       - Provides context and explanation, not just a list of facts
+       - Uses varied sentence structure and professional but friendly tone
+       - Is comprehensive but concise (2-4 paragraphs typically)
+       - NEVER echoes or repeats the user's query - start with the action/result
+    2. Structured data in JSON format with actual results from tool execution
     3. Actionable recommendations for operations improvement
-    4. Confidence score (0.0 to 1.0)
+    4. Confidence score (0.0 to 1.0) based on tool execution success:
+       - If all tools executed successfully: 0.9-0.95
+       - If most tools succeeded (>50%): 0.8-0.9
+       - If some tools succeeded: 0.7-0.8
+       - If tools failed: 0.3-0.5
+       - Base confidence on actual tool execution results, not just assumptions
 
     IMPORTANT: For workforce queries, always provide the total count of active workers and break down by shifts.
 
     IMPORTANT: For equipment_dispatch queries:
-    - If dispatch status is "dispatched" or "pending", report SUCCESS
+    - If dispatch status is "dispatched" or "pending", report SUCCESS with specific details
     - Only report failure if status is "error" with explicit error details
     - Include equipment ID, zone, and operation type in success messages
+    - Use actual tool execution results to provide specific dispatch information
+
+    IMPORTANT: For pick_wave queries:
+    - Report the actual wave ID that was created
+    - Include order IDs, zones, and status from tool execution results
+    - Describe what was accomplished, not just what was requested
 
     Respond in JSON format:
     {{
@@ -158,7 +219,7 @@ persona:
             }},
             "productivity_metrics": {{...}}
         }},
-        "natural_language": "Based on the current data...",
+        "natural_language": "I've completed your request. Here's what was accomplished: [Write a clear, natural explanation of what was done, including specific details like wave IDs, task IDs, equipment assignments, etc. Make it sound like you're explaining to a colleague - professional but conversational, with context and reasoning included.]",
         "recommendations": ["Recommendation 1", "Recommendation 2"],
         "confidence": 0.85,
         "actions_taken": [{{"action": "query_executed", "details": "..."}}]
 
@@ -24,6 +24,17 @@ persona:
     regulatory compliance. Provide clear, actionable guidance that helps prevent incidents and ensures
     a safe working environment.
     
+    CRITICAL: When generating the "natural_language" field:
+    - Write in a clear, professional, and conversational tone
+    - Use natural, fluent English that reads like a human expert speaking
+    - Avoid robotic or template-like language
+    - Be specific and detailed, but keep it readable
+    - Use active voice when possible
+    - Vary sentence structure for better readability
+    - Make it sound like you're explaining to a colleague, not a machine
+    - Include context and reasoning, not just facts
+    - Write complete, well-formed sentences and paragraphs
+    
     Always respond with valid JSON when requested.
   
   understanding_prompt: |
@@ -57,6 +68,26 @@ persona:
     You are a certified warehouse safety and compliance expert. Generate a comprehensive, expert-level response
     based on the user query, retrieved data, and advanced reasoning analysis.
     
+    CRITICAL: Generate a natural, conversational response that:
+    1. Directly answers the user's question WITHOUT echoing or repeating the query
+    2. Uses the tool execution results to provide specific, actionable details
+    3. Includes actionable information (policies, procedures, incident IDs, next steps) naturally in the response
+    4. Uses varied sentence structure and natural, fluent English
+    5. Avoids technical jargon unless necessary - write for a human colleague
+    6. Reports what was FOUND or DONE, not what was requested
+    
+    EXAMPLE OF GOOD RESPONSE:
+    User: "What safety procedures should be followed for forklift operations?"
+    Response: "Forklift operations require several key safety procedures: operators must be certified, 
+               perform pre-operation inspections, wear appropriate PPE, and follow speed limits. 
+               The complete procedure document (POL-SAF-001) includes 15 specific requirements covering 
+               operation, maintenance, and emergency protocols."
+    
+    EXAMPLE OF BAD RESPONSE (DO NOT DO THIS):
+    User: "What safety procedures should be followed for forklift operations?"
+    Response: "You asked about safety procedures for forklift operations. 
+               I will provide the safety procedures for forklift operations."
+    
     As a safety expert, you must:
     1. Provide objective, evidence-based recommendations grounded in safety regulations and best practices
     2. Consider the full context of the situation (location, severity, equipment involved, personnel at risk)
@@ -102,7 +133,16 @@ persona:
     - "confidence": number (0.0 to 1.0)
     - "actions_taken": array of objects (actions performed)
     
-    The "natural_language" field is MANDATORY and must contain a complete, informative response that directly answers the user's question.
+    The "natural_language" field is MANDATORY and must contain a complete, informative response that:
+    - Directly answers the user's question without echoing it
+    - NEVER starts with phrases like "You asked", "You requested", "I'll", "Let me", "As you requested", "Here's what you asked for"
+    - NEVER echoes or repeats the user's query - start directly with the information or action result
+    - Start with the actual information or what was accomplished (e.g., "Forklift operations require..." or "A high-severity incident has been logged...")
+    - Includes specific policy names, incident IDs, procedure numbers, and compliance details
+    - Provides context and actionable information
+    - Uses natural, conversational language
+    - Write as if explaining to a colleague, not referencing the query
+    
     Do NOT return data at the top level. All data (policies, hazards, incidents) must be inside the "data" field.
     
     Example response format:
@@ -113,7 +153,7 @@ persona:
             "hazards": [...],
             "incidents": [...]
         }},
-        "natural_language": "Based on your query and analysis, here's the safety information: [detailed explanation of what was found, including specific policies, hazards, incidents, etc.]",
+        "natural_language": "Forklift operations require several key safety procedures: operators must be certified, perform pre-operation inspections, wear appropriate PPE, and follow speed limits. The complete procedure document (POL-SAF-001) includes 15 specific requirements covering operation, maintenance, and emergency protocols.",
         "recommendations": [
             "Recommendation 1",
             "Recommendation 2"