Skip to content

[Feature Request] Video Analysis Toolkit Enhancement #1848

@Aaron617

Description

@Aaron617

Required prerequisites

Motivation

The current video analysis lack the ability to process long-video

Solution

  1. In test hour-long challenge challenge , the best-performing solution is to segment video. can use structured text to aggregate information.
  2. Gemini-2 may be better at video analysis (https://medium.com/@samarrana407/google-video-analyzer-gemini-2-0-b150c6f500fb)
  3. Other reference papers:
    internvideo2.5: https://github.com/OpenGVLab/InternVideo/tree/main/InternVideo2.5
    movie chat:https://arxiv.org/pdf/2307.16449

Alternatives

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

P1Task with middle level priorityenhancementNew feature or request

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions