Skip to content

[Suggestion] Add a new paper on Multimodal Agentic Reasoning: "Video-Browser" #10

@chrisx599

Description

@chrisx599

Hi! Thank you for this comprehensive collection of agentic reasoning works.

I would like to suggest adding a paper that explores reasoning and planning for multimodal agents (specifically in video browsing tasks).

Title: Video-Browser: Towards Agentic Open-web Video Browsing Link: https://arxiv.org/abs/2512.23044 Code: https://github.com/chrisx599/Video-Browser

We introduces a benchmark and an agent architecture that focuses on efficiently retrieve open-web information from videos, which aligns with the "Agentic Reasoning" theme of this repo.

Thanks for considering!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions