02-new-model.yml
02-new-model.yml
| Name | About | Labels | Assignees |
|---|---|---|---|
| 🤗 Support request for a new model from huggingface | Submit a proposal/request for a new model from huggingface | new model |
Before submitting an issue, please make sure the issue hasn't been already addressed by searching through the existing and past issues.
We also highly recommend you read https://nvidia.github.io/TensorRT-LLM/architecture/add-model.html first to understand how to add a new model.
A huggingface identifier, pointing to the model, e.g. meta-llama/Llama-3.1-8B-Instruct .
Here is the list of models already supported by TensorRT-LLM: https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/models (TRT backend) and https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/_torch/models (Pytorch backend) . Which model is the most similar to the model you want to add support for?
For example, any new operators or new architecture?
Thanks for contributing 🎉!