TensorRT-LLM/.github/ISSUE_TEMPLATE/02-new-model.yml at c9fe07ede649d3f9659728362af39883f5f502ce · NVIDIA/TensorRT-LLM

41 lines (39 loc) · 1.96 KB

Name	About	Labels	Assignees
🤗 Support request for a new model from huggingface	Submit a proposal/request for a new model from huggingface	new model

Before submitting an issue, please make sure the issue hasn't been already addressed by searching through the existing and past issues.

We also highly recommend you read https://nvidia.github.io/TensorRT-LLM/architecture/add-model.html first to understand how to add a new model.

The model to consider.*

A huggingface identifier, pointing to the model, e.g. meta-llama/Llama-3.1-8B-Instruct .

The closest model TensorRT-LLM already supports.

Here is the list of models already supported by TensorRT-LLM: https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/models (TRT backend) and https://github.com/NVIDIA/TensorRT-LLM/tree/main/tensorrt_llm/_torch/models (Pytorch backend) . Which model is the most similar to the model you want to add support for?

What's your difficulty of supporting the model you want?

For example, any new operators or new architecture?

Thanks for contributing 🎉!

Before submitting a new issue...

Make sure you already searched for relevant issues, and checked the [documentation](https://nvidia.github.io/TensorRT-LLM/) and [examples](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples) for answers to frequently asked questions.*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Before submitting an issue, please make sure the issue hasn't been already addressed by searching through the existing and past issues.

We also highly recommend you read https://nvidia.github.io/TensorRT-LLM/architecture/add-model.html first to understand how to add a new model.

FilesExpand file tree

02-new-model.yml

Latest commit

History

02-new-model.yml

File metadata and controls

Before submitting an issue, please make sure the issue hasn't been already addressed by searching through the existing and past issues.

We also highly recommend you read https://nvidia.github.io/TensorRT-LLM/architecture/add-model.html first to understand how to add a new model.