adding a tag on screenshots for mouse_click coordinate actions #240

TLSDC · 2025-04-24T17:34:42Z

Description by Korbit AI

What change is being made?

Add functionality to tag screenshots with action coordinates when the action is a mouse click, by drawing a dot on the screenshot at the specified coordinates.

Why are these changes being made?

The changes enhance the interpretability of screenshots by visually indicating where mouse click actions occurred, thereby aiding in better analysis and debugging during experiment result inspections. This approach provides a clear visual representation without modifying the core logic of action recording or screenshot capturing.

Is this description stale? Ask me to generate a new description by commenting /korbit-generate-pr-description

korbit-ai · 2025-04-24T17:34:47Z

Based on your review schedule, I'll hold off on reviewing this PR until it's marked as ready for review. If you'd like me to take a look now, comment /korbit-review.

Your admin can change your review schedule in the Korbit Console

Copilot

Pull Request Overview

This PR adds functionality to tag screenshots with a red dot when a "mouse_click" action is detected, giving a visual cue for coordinate-based actions.

Introduces the new function tag_screenshot_with_action to parse and render mouse_click coordinates on screenshots.
Updates the update_screenshot and update_screenshot_pair functions to apply the tagging functionality.

src/agentlab/analyze/agent_xray.py

korbit-ai

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Category	Issue	Status
	Missing Coordinate Parsing Error Handling ▹ view	✅ Fix detected
	Inconsistent Screenshot Pair Annotation ▹ view

Files scanned

File Path	Reviewed
src/agentlab/analyze/agent_xray.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

src/agentlab/analyze/agent_xray.py

+        coords = [c.strip() for c in coords]
+        if coords[0].startswith("x="):
+            coords[0] = coords[0][2:]
+        if coords[1].startswith("y="):
+            coords[1] = coords[1][2:]


korbit-ai · 2025-04-24T17:36:17Z

src/agentlab/analyze/agent_xray.py

+    if s1 is not None:
+        s1 = tag_screenshot_with_action(s1, info.exp_result.steps_info[info.step].action)


Inconsistent Screenshot Pair Annotation

Tell me more

What is the issue?

The screenshot annotation logic in update_screenshot_pair() only annotates the first screenshot (s1) but not the second one (s2).

Why this matters

This inconsistency means users won't see click markers on the second screenshot, making it harder to track the progression of actions across screenshot pairs.

Suggested change ∙ Feature Preview

def update_screenshot_pair(som_or_not: str): global info s1 = get_screenshot(info, info.step, som_or_not) s2 = get_screenshot(info, info.step + 1, som_or_not) if s1 is not None: s1 = tag_screenshot_with_action(s1, info.exp_result.steps_info[info.step].action) if s2 is not None and info.step + 1 < len(info.exp_result.steps_info): s2 = tag_screenshot_with_action(s2, info.exp_result.steps_info[info.step + 1].action) return s1, s2

Provide feedback to improve future suggestions

_{💬 Looking for more details? Reply to this comment to chat with Korbit.}

Co-authored-by: Copilot <[email protected]>

Copilot

Pull Request Overview

This PR adds a new function to annotate screenshots with a red marker for mouse click coordinate actions, improving the visual context for debugging. The key changes include:

Introducing tag_screenshot_with_action() to parse mouse click actions and draw a marker on screenshots.
Updating update_screenshot() and update_screenshot_pair() to use the new tagging function.

Copilot · 2025-04-24T17:42:21Z

src/agentlab/analyze/agent_xray.py

+            draw.ellipse((x - radius, y - radius, x + radius, y + radius), fill="red", outline="red")
+        except (ValueError, IndexError) as e:
+            warning(f"Failed to parse action '{action}': {e}")


The function tag_screenshot_with_action does not return the modified screenshot after drawing the marker. Consider adding a return statement at the end of the function (both after a successful draw and outside the if block) so that the updated image is returned.

Suggested change

draw.ellipse((x - radius, y - radius, x + radius, y + radius), fill="red", outline="red")

except (ValueError, IndexError) as e:

warning(f"Failed to parse action '{action}': {e}")

draw.ellipse((x - radius, y - radius, x + radius, y + radius), fill="red", outline="red")

return screenshot

except (ValueError, IndexError) as e:

warning(f"Failed to parse action '{action}': {e}")

return screenshot

recursix

Beautiful!!! Thanks

adding a tag on screenshots for mouse_click coordinate actions

c3e1792

TLSDC requested review from recursix and Copilot April 24, 2025 17:34

TLSDC marked this pull request as ready for review April 24, 2025 17:34

Copilot AI reviewed Apr 24, 2025

View reviewed changes

src/agentlab/analyze/agent_xray.py Outdated Show resolved Hide resolved

korbit-ai bot reviewed Apr 24, 2025

View reviewed changes

TLSDC and others added 2 commits April 24, 2025 13:38

darglint

6569fb1

Update src/agentlab/analyze/agent_xray.py

c6e5cb0

Co-authored-by: Copilot <[email protected]>

TLSDC requested a review from Copilot April 24, 2025 17:41

Copilot AI reviewed Apr 24, 2025

View reviewed changes

darglint

2f893a9

recursix approved these changes Apr 24, 2025

View reviewed changes

TLSDC merged commit 8666268 into main Apr 24, 2025
3 checks passed

TLSDC deleted the tlsdc/xray_image_tagging branch April 24, 2025 18:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding a tag on screenshots for mouse_click coordinate actions #240

adding a tag on screenshots for mouse_click coordinate actions #240

Uh oh!

TLSDC commented Apr 24, 2025 •

edited by korbit-ai bot

Loading

Uh oh!

korbit-ai bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

korbit-ai bot left a comment •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

korbit-ai bot Apr 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 24, 2025

Uh oh!

recursix left a comment

Uh oh!

Uh oh!

Uh oh!

		if s1 is not None:
		s1 = tag_screenshot_with_action(s1, info.exp_result.steps_info[info.step].action)

adding a tag on screenshots for mouse_click coordinate actions #240

adding a tag on screenshots for mouse_click coordinate actions #240

Uh oh!

Conversation

TLSDC commented Apr 24, 2025 • edited by korbit-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description by Korbit AI

What change is being made?

Why are these changes being made?

Uh oh!

korbit-ai bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

korbit-ai bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Review by Korbit AI

Korbit automatically attempts to detect when you fix issues in new commits.

Uh oh!

This comment was marked as resolved.

Uh oh!

korbit-ai bot Apr 24, 2025

Choose a reason for hiding this comment

Inconsistent Screenshot Pair Annotation

What is the issue?

Why this matters

Suggested change ∙ Feature Preview

Provide feedback to improve future suggestions

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

recursix left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

TLSDC commented Apr 24, 2025 •

edited by korbit-ai bot

Loading

korbit-ai bot left a comment •

edited

Loading