updated llm logs #2297

rexa0310 · 2025-12-19T09:12:46Z

Summary by CodeRabbit

New Features
- Enhanced LLM operation logging with additional metadata tracking, including timing, cost, and provider information.
Bug Fixes
- Improved response data handling with refined filtering and parsing.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-12-19T09:12:58Z

Walkthrough

The logger module refines internal handling of LLM response data by parsing the original_response from raw strings, stripping sensitive fields (model, data, object), and enriching the logging payload with additional metadata including timestamps, costs, and LLM-specific fields (llm_call_id, llm_provider, model_params, metadata).

Changes

Cohort / File(s)	Summary
LLM Logger Response Handling `kairon/shared/llm/logger.py`	Refined original_response parsing with empty dict defaults; strips sensitive top-level fields from dict responses; updates logging payload to pass cleaned response with enriched metadata (start_time, end_time, cost, llm_call_id, llm_provider, model_params, metadata)

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Focus on understanding the sensitive field stripping logic (model, data, object removal)
Verify the response parsing handles edge cases (null, malformed JSON, non-dict responses)
Confirm new metadata fields align with expected log saver interface and downstream consumers

Poem

🐰 A rabbit hops through logs with care,
Stripping secrets here and there,
Parsing responses, clean and bright,
With metadata enriched just right! ✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Title check	❓ Inconclusive	The title 'updated llm logs' is vague and generic, using non-descriptive language that doesn't convey the specific nature or significance of the changes.	Consider a more specific title that describes the main improvement, such as 'Refine LLM logging with response parsing and metadata enrichment' or 'Add sensitive field filtering and metadata to LLM logs'.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a74964e and 417c8ae.

📒 Files selected for processing (1)

kairon/shared/llm/logger.py (1 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Codacy Static Code Analysis
GitHub Check: Analyze (python)
GitHub Check: Python CI

🔇 Additional comments (1)

kairon/shared/llm/logger.py (1)

38-45: LGTM with dependency on exception handling.

The logging payload correctly passes the cleaned original_response along with enriched metadata. This is consistent with the parsing logic above. However, ensure that the exception handling for JSON parsing (flagged in lines 30-36) is addressed to prevent potential runtime failures.

coderabbitai · 2025-12-19T09:18:36Z

kairon/shared/llm/logger.py

+        raw = kwargs.get("original_response")
+        original_response = json.loads(raw) if raw else {}
+
+        if isinstance(original_response, dict):
+            original_response.pop("model", None)
+            original_response.pop("data", None)
+            original_response.pop("object", None)


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

cat -n kairon/shared/llm/logger.py | head -50

Repository: digiteinfotech/kairon

Length of output: 2497

🏁 Script executed:

grep -n "class LLMLogs" kairon/shared/llm/data_objects.py

Repository: digiteinfotech/kairon

Length of output: 91

🏁 Script executed:

sed -n '4,80p' kairon/shared/llm/data_objects.py

Repository: digiteinfotech/kairon

Length of output: 384

🏁 Script executed:

rg "DynamicField" kairon/shared/llm/data_objects.py -B 5 -A 2

Repository: digiteinfotech/kairon

Length of output: 283

🏁 Script executed:

rg "json\.loads" kairon/ -B 2 -A 2 | head -60

Repository: digiteinfotech/kairon

Length of output: 3699

🏁 Script executed:

rg "json\.loads" kairon/ -B 3 -A 3 | grep -A 3 "try\|except" | head -40

Repository: digiteinfotech/kairon

Length of output: 2378

🏁 Script executed:

grep -r "original_response" kairon/shared/llm/ --include="*.py"

Repository: digiteinfotech/kairon

Length of output: 619

🏁 Script executed:

python3 << 'EOF' # Check what MongoEngine's DynamicField accepts print("DynamicField in mongoengine accepts any Python object type.") print("It can store: dict, list, string, number, boolean, None, etc.") print("So non-dict values from json.loads are NOT a problem for LLMLogs.") print("\nHowever, the json.loads exception handling IS a valid concern.") EOF

Repository: digiteinfotech/kairon

Length of output: 151

Add exception handling for JSON parsing.

json.loads(raw) can raise a JSONDecodeError if raw contains malformed JSON, crashing the logging pipeline and disrupting LLM call tracking. The codebase establishes a pattern of wrapping json.loads calls in try-except blocks.

🔎 Proposed fix with exception handling

raw = kwargs.get("original_response") -original_response = json.loads(raw) if raw else {} +try: + original_response = json.loads(raw) if raw else {} +except (json.JSONDecodeError, TypeError) as e: + logger.warning(f"Failed to parse original_response as JSON: {e}") + original_response = {} if isinstance(original_response, dict): original_response.pop("model", None) original_response.pop("data", None) original_response.pop("object", None)

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

raw = kwargs.get("original_response")

original_response = json.loads(raw) if raw else {}

if isinstance(original_response, dict):

original_response.pop("model", None)

original_response.pop("data", None)

original_response.pop("object", None)

raw = kwargs.get("original_response")

try:

original_response = json.loads(raw) if raw else {}

except (json.JSONDecodeError, TypeError) as e:

logger.warning(f"Failed to parse original_response as JSON: {e}")

original_response = {}

if isinstance(original_response, dict):

original_response.pop("model", None)

original_response.pop("data", None)

original_response.pop("object", None)

🤖 Prompt for AI Agents

In kairon/shared/llm/logger.py around lines 30 to 36, the direct call to json.loads(raw) can raise JSONDecodeError and crash the logging pipeline; wrap the json.loads call in a try/except that catches json.JSONDecodeError (and a broad Exception fallback) and on failure set original_response = {} (or parse a best-effort/sanitized value) and log the parsing error for debugging; then proceed with the existing isinstance(dict) checks and pop calls so malformed JSON does not break the flow.

updated llm logs

417c8ae

coderabbitai bot reviewed Dec 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

updated llm logs #2297

updated llm logs #2297

Uh oh!

rexa0310 commented Dec 19, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Dec 19, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

updated llm logs #2297

Are you sure you want to change the base?

updated llm logs #2297

Uh oh!

Conversation

rexa0310 commented Dec 19, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rexa0310 commented Dec 19, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 19, 2025 •

edited

Loading