Skip to content

Latest commit

 

History

History
41 lines (39 loc) · 1.96 KB

File metadata and controls

41 lines (39 loc) · 1.96 KB
NameAboutLabelsAssignees
🤗 Support request for a new model from huggingfaceSubmit a proposal/request for a new model from huggingfacenew model

Before submitting an issue, please make sure the issue hasn't been already addressed by searching through the existing and past issues.

We also highly recommend you read https://nvidia.github.io/TensorRT-LLM/architecture/add-model.html first to understand how to add a new model.

A huggingface identifier, pointing to the model, e.g. meta-llama/Llama-3.1-8B-Instruct .

Here is the list of models already supported by TensorRT-LLM: https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/models (TRT backend) and https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/_torch/models (Pytorch backend) . Which model is the most similar to the model you want to add support for?

For example, any new operators or new architecture?

Thanks for contributing 🎉!

Before submitting a new issue...