Skip to content

Latest commit

 

History

History
39 lines (23 loc) · 815 Bytes

README.md

File metadata and controls

39 lines (23 loc) · 815 Bytes

GPT4V VQA

This repository offers a ROS Node with GPT4V model.

Installation

catkin build gpt4v_vqa

Usage

roslaunch gpt4v_vqa vqa.launch api_key:=<YOUR_API_KEY> VQA_INPUT_IMAGE:=<IMAGE TOPIC>

And from other terminal

$ rosrun gpt4v_vqa vqa_interpreter.py

Nodes

gpt4v_vqa

This node is a ROS wrapper for GPT4V model. Its behavior is similar to VQA node. But there is a difference that this node does not support continuous inference. This node use API only when action server is called.

Subscribed Topics

  • ~image ([sensor_msgs/Image])

    The image used for VQA as default image.

Action Servers

  • ~inference_server ([jsk_recognition_msgs/VQATaskAction])

    The action server for VQA.