Skip to content

Any example to quantise a text embedding model on Intel Gaudi2? #1919

Open
@sleepingcat4

Description

@sleepingcat4

I was looking for example or documentation how I can load or quantise both a HF embedding model on Intel Gaudi2. is there any examples available? I don't want to use docker btw

Metadata

Metadata

Labels

aitceAI TCE to handle it firstly

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions