How to deploy a new model based on triton and fastertransformer_backend? #104

WangYizhang01 · 2023-03-16T08:34:00Z

WangYizhang01
Mar 16, 2023

Hello, I am new to fastertransformer_backend and there are still many things that are not very clear to me. I have some questions to consult with you, mainly about how to deploy a new model.
Now fastertransformer_backend supports several models such as Bert and GPT. If I want to deploy a new model (not included in the Support matrix), such as AlphaFold, what work do I need to do?
Looking forward to your reply！

Answered by byshiue

Mar 16, 2023

FasterTransformer cannot parse the model architecture. So, for a new model, you may need to develop the model (different model architectures and cuda kernels) first, and then encapsulate it by the triton backend. Then, you can call it in triton.

View full answer

byshiue · 2023-03-16T10:33:19Z

byshiue
Mar 16, 2023
Maintainer

FasterTransformer cannot parse the model architecture. So, for a new model, you may need to develop the model (different model architectures and cuda kernels) first, and then encapsulate it by the triton backend. Then, you can call it in triton.

1 reply

siddharth-mavani Jun 12, 2023

Can you please explain further on what you mean by "you may need to develop the model (different model architectures and cuda kernels) first". Secondly, how do I encapsulate it by the triton backend ?

Is there documentation available related to the above queries ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to deploy a new model based on triton and fastertransformer_backend? #104

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to deploy a new model based on triton and fastertransformer_backend? #104

Uh oh!

WangYizhang01 Mar 16, 2023

Replies: 1 comment · 1 reply

Uh oh!

byshiue Mar 16, 2023 Maintainer

Uh oh!

siddharth-mavani Jun 12, 2023

WangYizhang01
Mar 16, 2023

Replies: 1 comment 1 reply

byshiue
Mar 16, 2023
Maintainer