Change the repository type filter
All
Repositories list
21 repositories
grps_trtllm
PublicHigher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.grps
PublicDeep Learning Deployment Framework: Supports tf/torch/trt/trtllm/vllm and other NN frameworks. Support dynamic batching, and streaming modes. It is dual-language compatible with Python and C++, offering scalability, extensibility, and high performance. It helps users quickly deploy models and provide services through HTTP/RPC interfaces.TensorRT-LLM
PublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.HSTU-Tensorflow
PublicHGCDR
PublicFPARec
PublicControlTalk
Publicgrps_examples
Publicgrps_vllm
Publicnn-infer-opt
Publiceasy-ngo
Publiceasy-ngo-website
Publiceasy-ngo-doc
Public archiveCSCE-Net
Publiceasy-ngo-examples
Publiceasy-ngo-layout
Publiceasy-ngo-tools
Publicngo
Public archivengo-demo
Public archivenetease-media.github.io
PublicPAMM-HiA-T5
Public