Viz.ChatGrid: add Context Management Module for Large SQL-Backed AI Chat Responses

# Problem
Viz.ChatGrid currently passes raw SQL query results directly into the LLM prompt when generating chat responses.
When result sets are large (multi-state, multi-year, etc.), this causes:
- LLM context window overflow ([current limit is 4k](https://github.com/pnnl/ExaGO/blob/develop/viz/images/chatgrid_arch.png))
- Truncation of SQL output
- Biased answers caused by SQL ordering (e.g. only the first state like Alabama appears in a multi-state query)

This makes chat responses incorrect, misleading, or incomplete.
The system currently relies on auto-generated SQL filtering and ordering to control what fits into the LLM prompt, which is fragile and unsupervised.

# Root Cause
There is no explicit layer responsible for:
- Token budgeting
- Result sampling / ranking
- Summarization
- Format-specific context shaping

The LLM receives raw database output, not curated context.

# Proposed Solution
Add a Context Management Module between the database and ChatResponsePrep.

```mermaid
flowchart TB
  user>"USER"]
  user --"natural lang query"--> chat
subgraph "Front__End"
  chat[["CHAT"]]
  end
subgraph "BackEnd"
  LangChain[\"LangChain"/]
  DB[("DB")]
  LangChain --"srl request"-->DB
  ChatResponsePrep[\"Generate Natural language AI response for chat"/]
  OpenAIChatResponse["OpenAI"]
  chat --"natural lang query"--> LangChain
  ChatResponsePrep --"context enriched OpenAI request"--> OpenAIChatResponse
  OpenAIChatResponse --"sql-return context-powered reply"--> ChatResponsePrep
  ContextManagementModule(("Context
  Management
  Module"))
  DB --"raw data"--> ContextManagementModule
  ContextManagementModule --> ChatResponsePrep
  LangChain --"natural lang query"--> ChatResponsePrep 
  ChatResponsePrep --"human-readable reply"-----> chat
end
style ContextManagementModule fill:#000,stroke:#f55,stroke-width:4px,color:#fff,stroke-dasharray: 5 5
```

This module should:
- Enforce token budgets
- Select representative rows when full data doesn’t fit
- Perform summarization or aggregation where appropriate
- Provide structured, format-aware context to the LLM


# Goals / Acceptance Criteria
The system should guarantee that:
- Multi-group queries (e.g., multi-state, multi-year) always include all groups
- No single category dominates context due to SQL ordering
- The LLM never receives raw, unbounded SQL dumps
- Context fits within LLM token limits while preserving correctness

# Possible Implementations (non-binding)

- MCP-based context governance (schema, budgets, coverage guarantees)
- Result ranking + sampling
- Group-aware row selection
- SQL-native aggregation (GROUP BY, COUNT, SUM, AVG, etc.)
- Pre-LLM aggregation (per state, per year, etc.)
- RAG-style retrieval
- LLM-based summarization before final prompt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Viz.ChatGrid: add Context Management Module for Large SQL-Backed AI Chat Responses #191

Problem

Root Cause

Proposed Solution

Goals / Acceptance Criteria

Possible Implementations (non-binding)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Viz.ChatGrid: add Context Management Module for Large SQL-Backed AI Chat Responses #191

Description

Problem

Root Cause

Proposed Solution

Goals / Acceptance Criteria

Possible Implementations (non-binding)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions