Skip to content

Gemm、multihead_attn算子校准量化 #6689

@DRTAXI

Description

@DRTAXI

expectation | 诉求 | 期待する

  1. speed
  2. precision

model | 模型 | モデル

  1. model.param and model.bin

detail | 详细描述 | 詳細な説明

目前ncnn2table和ncnn2int8不支持Gemm、multihead_attn算子的校准量化,想问一下什么时候可以对该类算子进行校准量化呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions