Skip to content

vllm + ui-tars的使用方式导致浏览器插件报错 #319

Closed
@Rabbit-bear

Description

配置如下

OPENAI_BASE_URL=http://xxxx:8000/v1
MIDSCENE_MODEL_NAME=ui-tars
OPENAI_API_KEY=empty
MIDSCENE_USE_VLM_UI_TARS=1

报错内容如下

No prompt or id to locate
AssertionError [ERR_ASSERTION]: No prompt or id to locate
    at Object.executor (chrome-extension://gbldofcpkknbggpkmbdaefngejllnief/lib/popup.js:1:4314347)
    at e.Executor.flush (chrome-extension://gbldofcpkknbggpkmbdaefngejllnief/lib/popup.js:1:4271053)
    at async G.actionToGoal (chrome-extension://gbldofcpkknbggpkmbdaefngejllnief/lib/popup.js:1:4321886)
    at async Y.aiAction (chrome-extension://gbldofcpkknbggpkmbdaefngejllnief/lib/popup.js:1:4326926)
    at async Object.onClick (chrome-extension://gbldofcpkknbggpkmbdaefngejllnief/lib/popup.js:1:5609153)

浏览器插件版本

Midscene.js
0.25

请求内容

{
  "model": "ui-tars",
  "messages": [
    {
      "role": "user",
      "content": "\nYou are a GUI agent. You are given a task and your action history, with screenshots. You need to perform the next action to complete the task. \n\n## Output Format\n```\nThought: ...\nAction: ...\n```\n\n## Action Space\nclick(start_box='[x1, y1, x2, y2]')\nleft_double(start_box='[x1, y1, x2, y2]')\nright_single(start_box='[x1, y1, x2, y2]')\ndrag(start_box='[x1, y1, x2, y2]', end_box='[x3, y3, x4, y4]')\nhotkey(key='')\ntype(content='') #If you want to submit your input, use \"\\n\" at the end of `content`.\nscroll(start_box='[x1, y1, x2, y2]', direction='down or up or right or left')\nwait() #Sleep for 5s and take a screenshot to check for any changes.\nfinished()\ncall_user() # Submit the task and call the user when the task is unsolvable, or when you need the user's help.\n\n## Note\n- Use Chinese in `Thought` part.\n- Write a small plan and finally summarize your next action (with its target element) in one sentence in `Thought` part.\n\n## User Instruction\n点击介绍"
    },
    {
      "role": "user",
      "content": [
        {
          "type": "image_url",
          "image_url": {
            "url": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQAAAQABAAD/4gIoSUNDX1BST0ZJTEUAAQEAAAIYAAAAAAQwAABtbnRyUkdCIFhZWiAH4AABAAEAAAAAAABhY3NwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAQAA9tYAAQAAAADTLQAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAlkZXNjAAAA8AAAAHRyWFlaAAABZAAAABRnWFlaAAABeAAAABRiWFlaAAABjAAAABR3dHB0AAABoAAAABRyVFJDAAABtAAAAChnVFJDAAABtAAAAChiVFJDAAABtAAAAChjcHJ0AAAB3AAAADxtbHVjAAAAAAAAAAEAAAAMZW5VUwAAAFgAAAAcAEcAbwBvAGcAbABlAC8AUwBrAGkAYQAvAEUANgAwADkAQQA0ADEAMAA2ADYAOQAxAEMAMwAzADcARgA3AEIARABDADIAQQAzADgAQQBGAEQANQBDADUAMVhZWiAAAAAAAABxKAAAOfEAAAJqWFlaIAAAAAAAAF16AAC2rwAAD3BYWVogAAAAAAAAKDMAAA9gAADBUlhZWiAAAAAAAAD21gABAAAAANMtcGFyYQAAAAAABAAAAAJlQgAA8xUAAAyqAAAT5QAAC4MAAAAXAAAAAG1sdWMAAAAAAAAAAQAAAAxlblVTAAAAIAAAABwARwBvAG8AZwBsAGUAIABJAG4AYwAuACAAMgAwADEANv/bAEMACgcHCAcGCggICAsKCgsOGBAODQ0OHRUWERgjHyUkIh8iISYrNy8mKTQpISIwQTE0OTs+Pj4lLkRJQzxINz0+O//bAEMBCgsLDg0OHBAQHDsoIig7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7Ozs7O//AABEIBuQMCgMBIgACEQEDEQH/xAAcAAEAAgMBAQEAAAAAAAAAAAAAAgMEBQYBBwj/xABoEAABAwMCAwQFBAkQBQkEBgsAAQIDBAURBhITITEHQVFhFCJxgZEVMlKhFiMzQpKxwdHwFzY3Q1VWYmNyc3SCk5Sy0iQ1dbPhJTRTVFeVosLxJieD00VGZISFo6TD4ghER2U4ZuN2/8QAGQEBAQEBAQEAAAAAAAAAAAAAAAIBBAMF/8QANREBAQEAAgEDAgMIAgEDBQEAAAERAhIDITJBBDETUXEUImGBkaGx8MHRQsLh8RUjJFJisv/aAAwDAQACEQMRAD8A+nveUukEjjHe8+hx4vncuSxZCCyFLnkN/mes4PK82TxD3iGLvG8dGd2XxBxDGR56jx1b3ZXEPeIYu893k9W9mTxD1HmOjj3cZ1V2ZG89R5j7j1HGdW9mRvPd5jo4luM6t7L957vKEce7jOrey7ee7yncMmY3su3nu8p3DcMb2XbxvKtw3GdTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VuG4dTst3jeVbhuHU7Ld43lW4bh1Oy3eN5VShow more

响应内容

{
    "id": "chatcmpl-0f7425ae6ec445ad8cff7b75636d1a1d",
    "object": "chat.completion",
    "created": 1737708523,
    "model": "ui-tars",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "content": "从当前页面看,我需要点击左侧导航栏中的\"介绍\"选项,以查看关于 Midscene.js 的基本介绍信息。这个选项位于左侧导航栏的顶部,点击它可以展开更多内容。\nAction: click(start_box='(35,145)')",
                "tool_calls": []
            },
            "logprobs": null,
            "finish_reason": "stop",
            "stop_reason": null
        }
    ],
    "usage": {
        "prompt_tokens": 2947,
        "total_tokens": 3008,
        "completion_tokens": 61,
        "prompt_tokens_details": null
    },
    "prompt_logprobs": null
}

请教下这个是什么原因呢

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions