Skip to content
This repository was archived by the owner on Feb 3, 2025. It is now read-only.
This repository was archived by the owner on Feb 3, 2025. It is now read-only.

No speed improvements after TF-TRT optimizing on a tensorflow BERT model #330

Open
@SohaKhazaeli

Description

@SohaKhazaeli

After optimizing the model with either FP32 or FP16 I don't get any speed improvements.

The optimization is done on tensorflow/tensorflow:2.10.0-gpu docker image. The model uses tensorflow-text and tf-models-official libraries

This is the log from optimization process:

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions